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The density of states of disordered hopping models generically exhibits an essential singularity 
around the edges of its support, known as a Lifshitz tail. We study this phenomenon on the Bethe 
lattice, i.e. for the large-size limit of random regular graphs, converging locally to the infinite regular 
tree, for both diagonal and off-diagonal disorder. The exponential growth of the volume and surface 
of balls on these lattices is an obstacle for the techniques used to characterize the Lifshitz tails in 
the finite-dimensional case. We circumvent this difficulty by computing bounds on the moments of 
the density of states, and by deriving their implications on the behavior of the integrated density of 
states. 

< 

The seminal work of Anderson [l[ has given birth to a vast body of literature on the properties of transport in random 
environments, both in physics (see for a recent review) and in mathematics (monographs include for instance 0- 
One central question in this domain is to determine whether a particle can diffuse freely in the environment, 
according to the dimensionality of the model, the intensity of the disorder, and the energy of the particle. This 
question can be rephrased in terms of the spreading of eigenvectors of the Hamiltonian corresponding to this energy, 
or more mathematically in terms of the nature (i.e. absolutely continuous versus pure-point) of its spectrum. Another 
direction of investigation of these models concerns the density of states Q of such random Hamiltonians, roughly 
speaking the distribution of eigenvalues, irrespectively of the nature (localized or extended) of the corresponding 
eigenvectors. Even though this quantity does not reflect directly the localization properties of the Hamiltonian @,H|], 
I 1 its study by Lifshitz Q revealed very early an interesting behavior around the edge of the spectrum, namely a very 
fast vanishing following an essential singularity known as a Lifshitz tail. This behavior can be explained intuitively 
as follows. In dimension d, an eigenvector of the non-disordered Hamiltonian with an eigenvalue close to the band 
edge, say at a very small distance S from it, is supported on a volume of order 5~ d / 2 . In presence of disorder of 
i bounded amplitude, this vector can give rise to an eigenvalue at an energy S lower than the (displaced) band edge 

£ max only if the random potentials on all sites in this volume are close to their maximal value. The probability of 
this event is exponentially small in the number of involved sites, hence the form of the Lifhsitz tail in dimension d, 
p(E max — 5) ~ exp[— <5~ <i / 2 ]. This heuristic reasoning can be turned into a rigorous derivation, see for instance chapter 
VI.2 in [p and [Icj for an illuminating exposition. Note also that even if the density of states is not directly related to 
\^ ■ the localization aspects of the problem, proofs of localization in finite dimension based on the multi-scale analysis 11 1 
iy-j | rely crucially on its estimates. 

It has been realized early on [l2|, EH that the Anderson model could also be studied on the Bethe lattice, thus 
enabling a mean-field analysis of the localization transition (that does not exist on the fully-connected, complete 
graph). Since then this version of the model has been the subject of several works both in physics |L4l - [l9| and in 
mathematics [20|-[27|. Among other results these papers contain numerical procedures to compute the density of 
states and the location of the localization transition fij, EH, EMI, as well as proofs of the existence of an absolutely 
continuous part of the spectrum at low disorder |23l425| . and of localization at large disorder or on the border of the 
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spectrum [22|. The study of sparse random matrices (28l - l34j is actually closely connected to the Anderson problem 
on the Bethe lattice, even though the perspective taken is slightly different; in the latter case the disorder appears 
through the connectivity properties of the underlying random graph with fluctuating degrees. 

We shall focus in this paper on the Lifshitz tail phenomenon for the Bethe lattice geometry. Heuristic arguments put 
forward for instance in |14| . fl9| and a rigorous analysis for a particular type of disorder [35l |36| suggest an even more 
violent behavior of the density of states in this regime, namely a vanishing of the form p(E max — S) ~ exp[— exp[(5~ 1 / 2 ]], 
that makes this regime very hard to study numerically. As a first example we show in Fig. [1] a plot of the density 
of states obtained by a standard numerical procedure recalled in Appendix [Al which displays an apparent band edge 
far from its exactly known value. This different form (doubly-exponential) of the Lifshitz tail with respect to the 
finite-dimensional case can be associated to the exponential (instead of polynomial) growth of the volume of a ball as 
a function of its radius on the Bethe lattice (formally corresponding to d — > oo). In addition the surface of a ball is 
asymptotically equivalent to its volume on such a non-amenable graph. This strongly complicates the transposition of 
the scheme of proofs of Lifshitz tails from the fmite-ti case to the Bethe lattice one, and indeed rigorous results on the 
Lifshitz tail behavior on Bethe lattices or sparse random graphs are restricted to sub-critical percolation models [35l — 
The alternative method developed in this paper to circumvent this difficulty consists in studying the asymptotic 
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behavior of the moments of the density of states. The bounds on the moments that we obtain, and their consequences 
for the integrated density of states, are in agreement with the doubly-exponential form of the Lifshitz tail on the 
Bethe lattice [II M, M, m ■ 

The rest of the paper is organized as follows. In Sec. [TT] we define more precisely the models under study and we 
state our main results. In Sec. IHII we collect several expressions of the moments of the density of states. The following 
two sections pVI and [Vj present the proofs of our results for the off-diagonal and diagonal disorder case, respectively. 
Each of these two sections is divided into two subsections focusing on lower and upper bounds on the moments of 
the density of states. Finally we draw our conclusions and propose perspectives for future work in Sec. IVII More 
technical aspects of our work are collected in a series of Appendices. 
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FIG. 1: Density of states on the Bethe lattice of degree k + 1 = 4, in the pure (Jij = 1) and disordered case (with off- 
diagonal disorder Jij uniformly distributed on [—1,1], see below for a precise definition). The pure case result is the Kesten- 
McKay [3fij, H|| density given in Eq. @. The curve of the disordered case was obtained following the numerical procedure 
recalled in Appendix [A] The support of these two distributions is the same, despite the apparent vanishing of the disordered 
one due to the strong Lifshitz tail effect. 



II. DEFINITIONS AND RESULTS 

In this section we shall define the density of states of Anderson models on the Bethe lattice, and state our main 
results on the asymptotic growth of its moments and their counterparts on the decay of the integrated density of states 
near the edge of the spectrum. The (integrated) density of states is usually defined, for finite-dimensional models 
considered on the Z d lattice, via a limiting procedure from finite boxes [—L,L] d with L — > oo. The corresponding 
construction on an infinite regular tree, namely cutting a depth L neighborhood around an arbitrarily chosen vertex, 
is not satisfactory because the number of vertices on the surface of a ball of radius L is of the same order as the one 
of its interior. Finite-size regularizations of the Bethe lattice are instead provided by random regular graphs (see for 
instance [40] for such a discussion in the context of spin-glasses). We shall hence begin this section by defining both 
finite (in Sec. Ill A| ) and infinite (in Sec. Ill B| ) versions of the Bethe lattice models, and highlight the connection between 
the two. We will in particular define the density of states and its moments, and discuss its regularity properties in 
Sec lII CI Our main results on the asymptotic growth of the moments of the density of states are given in III E( to make 
them more intuitively understandable we discuss just before, in Sec. Ill Dt how the behavior of a probability density 
close to its edge is reflected in the type of growth of its moments. 
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disordered case 
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A. Anderson models on random regular graphs 



Let us consider a finite graph Q — (V,£) on N vertices, and aJVxJV symmetric matrix H defined by its matrix 
elements 



The real numbers Vi (resp. Jy = Jji) represent the influence of the disorder on the vertex i (resp. on the edge 
between i and j). In the pure case (without disorder), i.e. when Vi = and Jy = I for all vertices and edges, H is 
nothing but the adjacency matrix of the graph. This well-known case will be considered for comparison purposes in 
the following. 

For a given realization of the Hamiltonian matrix H its empirical spectral measure is defined as a sum of Dirac 
atoms on its eigenvalues Ai, . . . , Ajv' 



We turn this measure into a random object by taking for Q a random k + 1-regular graph (i.e. choosing it uniformly 
at random among all graphs on N vertices, where all vertices have degree k + 1), for the diagonal elements V, a set of 
independent identically distributed (i.i.d.) random variables, and another set of i.i.d. random variables for the Jy's 
with i < j. The probability measure associated to this construction will be denoted P[-], and corresponding averages 
as E[-]. In the following we will concentrate mostly on two particular cases: 

• the off-diagonal (or bond) disorder case, in which all the Vi vanishes. 

• the diagonal (or site) disorder case, in which all the Jy are equal to 1. 

In the former case we shall make the following assumptions on the random edge couplings: 

• (OD1) J has a support included in [—1,1], with |J| not always equal to 1, but which takes values arbitrarily 
close to 1 with positive probability. In technical terms, we assume the existence of J m i n > such that < 
P[\J\ > M < 1 for all J with J min < J < 1. 

• (OD2) Va > , a e (0, 1/2) , | lnP(J > 1 - e)| e - Q / £ ° = 0(e) when e ->■ 0. 
The corresponding assumptions for the random energies Vi in the latter case will be: 

• (Dl) V has a support included in [0, W], distinct from {W}, giving positive weight to a neighborhood of W. 

• (D2) Va > , (16(0,1/2), | lnP(V > W - e)\e~ a ^ a = 0(e) when e -> 0. 

The simplest examples of random variables satisfying these assumptions are the uniformly random ones (on [— 1 , 1] 
for J and on [0, W] for V), or the Bernoulli random variable which corresponds, in the off-diagonal case, to the bond 
percolation model on the Bethe lattice. The assumptions (OD2) and (D2) imply that the occurrence of a Jij (or Vi) 
close to its maximal value is not by itself a very rare event, otherwise the Lifshitz tail phenomenon would be controlled 
by single site large deviations and would not be a collective effect anymore, as will become clear in Sec. IIV AI and 
Sec. IV Al Note that the assum ptio n (Dl) does not hold for the Cauchy and Gaussian distributions of diagonal disorder 
studied for instance in [l6|, l2ll. |26|. |27| . The choice of a positive support for the diagonal disorder will greatly simplify 
the proofs, without loss of generality: adding a constant value to the V^s only shifts the support of pjy. 

The local convergence of random regular graphs to infinite trees ensures [4l[ that pn converges (weakly, in distribu- 
tion) in the N — ¥ oo limit to a probability measure p, that we shall call, following the physics convention with a slightly 
abusive terminology discussed in the next subsections, the density of states. For instance in the pure case where H 
is the adjacency matrix of a random (k + l)-regular graph, p is the Kesten-McKay measure (38l . [3^ |. supported on 
[— 2y/k, 2\/k\ with the density 




(1) 




(2) 



Pkm(E) 



fe+1 V4fc - E 2 
2tt (k + l) 2 -E 2 ' 



(3) 



4 



In the next subsection we shall see how to define the measure p directly on an infinite object without this N — >• oo 
limit. Before that we define the average moments of the empirical spectral measure, 

= lE[tr J ff»]=E[(ff") 00 ] , (4) 

where in the last step we have used the invariance of the random graph ensemble with respect to the permutations of 
the vertices and denoted the index of an arbitrary vertex in V. We shall use the symbol u lly N for a generic disorder 
distribution, and denote instead these average moments c n ,N (resp. d n ^) for the off-diagonal (resp. diagonal) disorder 
case. We define u n (resp. c n , d n ) as the N — > oo limit of it^jv (resp. c ni N, d n ^)- The above stated convergence 
of pn imply that this limit exists and that the u n are the moments of the limiting measure, the density of states p 
(a "self-averaging" property). The existence of the limit for the average moments can in fact be obtained in a more 
direct way: expanding (H n )ao in the last expression of Eq. (j4]) shows that u n ,jv depends only on the disorder in the 
subgraph neighboring the reference vertex within a distance at most n/2, which converges in the N — > oo limit to 
a regular tree. 
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B. Anderson models on infinite trees 



It is also possible to define Anderson models directly on infinite graphs, that we keep denoting Q — (V, £). The 
infinite counterpart of the matrix H becomes an operator acting on the elements ip of the Hilbert space H = £ 2 (V) as 

{Htp)i = ^2 J v fj +ViVi i (5) 

jEdi 

where di stands for the set of neighbors of i in the graph Q. When the variables Jij = Jji on the edges of £, the 
ViS on the vertices, and the degrees \di\ of the vertices, are all uniformly bounded, then H is a self-adjoint operator. 
For the convenience of the physicist reader who wants to immerse him/herself in the mathematical literature we shall 
sketch in an informal way some of the concepts used in the mathematical literature on localization, even though this 
is not the main subject of the paper. Exhaustive reviews of this subject can be found in the monographs [3H5J and in 
the lecture notes |42j |. 

A major difficulty in dealing with infinite dimensional Hilbert space is that some of the solutions of the eigenvector 
equation Hp = Ecp are not normalizable, i.e. not in £ 2 (V). The spectrum <j(H) has thus to be defined as the 
complement of the resolvent set. The latter is the set of complex numbers z such that the resolvent operator (or Green 
function) (H — zIY^ exists and is bounded (I denotes the identity operator). When H is self-adjoint o{H) C M. The 
spectral theorem [43[ is the extension to infinite dimensional self-adjoint operators of the diagonalization of Hcrmitian 
matrices. It asserts that H can be written as 

H = J X dp(X) , (6) 

where fx is a projection valued measure providing a resolution of identity: for all (Borel) subsets / of M, p{I) is an 
orthogonal projection. In the finite dimensional case /x(/) would project on the subspace spanned by eigenvectors 
associated to eigenvalues E *E I. From this projection valued measure one can define the spectral measures, which 
are real measures associated to elements p £ H according to (J> V (I) = (p\p,(I)\p) . Any real measure r] can be 
decomposed [44| in three contributions of different types, rj = r/ pp + ?7 ac + ?7 SC , where r] pp is the pure point part 
of r\ (its set of Dirac peaks), r) ac its absolutely continuous part (which gives no weight to sets of zero Lebesgue 
measure, and has a density thanks to the Radon-Nikodym theorem), and the remainder ?7 SC is the singular continuous 
contribution. Combining this decomposition of real measures with the definition of the spectral measures fx v leads 
to a partition of the Hilbert space as H = H ac © T-L pp © H sc , where Hi is defined, for i = pp, ac, sc, as "Hi = {p € 
T-L | p, v is purely of type i}. Each of these subspaces is stable under the action of H, one can thus define the three 
spectra <Ji(H) as a(H f Hi), where H \ Hi denotes the restriction of H to Hi. In general a pp (H), a ac (H) and a sc (H) 
are not disjoint. The pure-point spectrum contains the "true" eigenvalues, corresponding to localized (normalizable) 
states, while the absolutely continuous spectrum is associated to generalized eigenvalues and extended states. This 
classification is reflected in the dynamical evolution of the system: a wavepacket constructed from states in the 
absolutely continuous spectrum spread out at infinity, while pure-point ones remain localized in a finite volume. 

This discussion concerned a given operator H; when the coupling constants Jij and on-site energies Vi are turned in 
random variables, all the quantities defined above become themselves random. However when the Jij and Vi are inde- 
pendent (or more generally form an ergodic process), it follows from ergodicity that the spectrum <r{H) is equal, with 
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probability 1, to a deterministic set independent of the random variables Jy and Vi. Moreover this set is equal to the 
spectrum of a (pure) model where all random variables are fixed deterministically to their extremal values. A stronger 
ergodicity statement is actually true: the spectra a pp (H), a ac {H) and a sc (H) are also independent, with probability 
1, of the actual realization of the disorder. One of the main goal of the localization studies is the characterization 
of these sets, depending on the underlying graph and the distribution of the disorder. In unidimensional models the 
spectrum is completely pure-point for infinitesimal amounts of diagonal disorder [45[ , while in large dimensions there 
exists a mobility edge separating the extended and localized [n], [22[ parts of the spectrum. 

The "density of states" p is defined as the average of the spectral measure associated to an element <p £ supported 
on a single arbitrary site £ V, tpi — 5i_o, that we shall denote 

p = E[<0|/*|0>] • (7) 

Its denomination is a bit misleading from a mathematical point of view: p is a probability measure, a priori not 
absolutely continuous and in consequence not associated to a density (we shall come back to this point shortly 
afterwards). The mathematical literature calls instead integrated density of states (IDS) the cumulative distribution 
function N(E) = p((—oo,E}) which always exists. In finite dimensions (i.e. for V = Z d ) the IDS can be constructed 
as follows: call H L the restriction of the operator H to a box of volume L d with some boundary conditions. H L acts 
on a finite-dimensional space and can thus be diagonalized. Let N L (E) be the fraction of its eigenvalues smaller than 
or equal to E; it can then be proven that N L converges as L is sent to infinity to the (deterministic) distribution 
function N defined above directly from the infinite-size operator. Another general result on the IDS is that its support 
coincides with the (almost sure) spectrum cr(H). 

Let us now come back to the main subject of the paper and specialize some of the definitions above to the (infinite) 
Bethe lattice. In that case Q is the regular tree in which every vertex has k + 1 neighbors, sketched in the left panel of 
Fig. [5] and denoted Tfe in the following. We shall also use the rooted fc-ary tree Tfe, in which there is a distinguished 
vertex (the root) that has no parent node, and every vertex has exactly k children, see right panel of Fig. [2l We will 
sometimes also consider Tfe as a rooted tree, choosing an arbitrary vertex as the root, and calling children of a vertex 
V 7^ its k neighbors away from the root. We take the Jy = Jji as a collection of i.i.d. random variables on the edges, 




and similarly with the V.'s another set of i.i.d. random variables. In the pure case ( J\j = 1, Vi = 0) the spectrum of 
H is the interval [— 2y/k, 2\/k]. In the diagonal disorder case (Jy = 1) with the Vi i.i.d. random variables supported 
on [0, W] the ergodicity result mentioned above (see [2l| for a discussion of ergodicity on the Bethe lattice) implies 
that the (almost sure) spectrum of H and the support of p is [—2y/k, 2^/k + W]. Similarly in presence of off-diagonal 
disorder (Jy satisfying (OD1), Vi — 0) the support of p is the same as in the pure case, [-2Vfc,2VE]. Tr ees are m 
some respects much simpler than finite-dimensional lattices, as they admit a natural recursive decomposition. One 
can for instance picture Tfe as made of its root, fc + 1 edges from the root to its neighbors, and k + 1 copies of the 
rooted fc-ary tree Tfe . This decomposition allows to write explicit recursion relations on the diagonal elements of the 
resolvent (Green function). Let us call G(z) = (0\(H — zl) _1 |0) the Green function at the root of Tfe, where z has 
a positive imaginary part. This is a random variable because of the disorder in the and Vi. Using the recursive 
nature of Tfe and resolvent identities (see App.Ofor a more explicit derivation with slightly different notations) one 
can show [H,|4l[ that G(z) obeys the following recursive distributional equation (RDE): 

G(z) = L= = , G(z) = \ ^ . (8) 

V-z- JfGi(z) V-z-TLi J? G i( z ) 

Here = denotes the equality in distribution of random variables, G is the equivalent of G on Tfe instead of Tfe, and 
the Gi (resp. J,, V) are i.i.d. copies of G (resp. J and V the distribution of the off-diagonal and diagonal disorder). 
By definition E[G(z)] is the Stieltjes transform of p, hence the density of the absolutely continuous part of p (see the 
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next subsection for a discussion of the regularity properties of p) can be computed from the solution of the RDE as 

Pac (E) = lim —ImE[G(E + ir))\ . (9) 
rj\0 7r 

The equations ([5| and ([§]) can be solved numerically, and this is how we obtained the curves in Figs. 11131 and [51 We 
give more details on the numerical procedure in App. [A] 

As mentioned in the previous section the density of states p constructed here directly on the infinite regular tree 
coincides with the large size limit of the empirical spectral measures for random regular graphs. At variance with finite- 
dimensional models, the density of states of the Bethe lattice cannot be obtained as the limit of growing subgraphs 
of the infinite tree: the surface of the latter grows as fast as their volume, which makes this construction ill-behaved. 

We defined previously u n as the n'th moment of p. The expression of p that follows from Eqs. (|6l7p indicates that 
these moments can be computed on the infinite tree as u n = E[(0|ff n |0)]. We shall similarly define u n as E[(0|-ff™|0)], 
where H is now restricted on rooted at vertex 0, and denote these moments c n (resp. d n ) in the off-diagonal (resp. 
diagonal) case. 

Let us note that in principle one could prove analytical results on the Lifshitz tail behavior of the density of states 
by characterizing directly the solution of the RDE (JSJ . However it is a difficult task to handle quantitatively this kind 
of equation, this is why we followed the indirect approach via the moments of the density of states in this paper. 

C. Absolute continuity of the density of states 

As already underlined p is in general a probability measure and as such is not guaranteed to be absolutely continuous 
(i.e. to have a density). However additional assumptions on the random variables Jij and Vi besides (OD1-OD2) 
or (D1-D2) are known to imply such a regularity of p (this is why we denoted p{E) instead of p &c (E) the density 
of states in Figs. 11131 and |9")). Let us first discuss the diagonal disorder case. If V has a bounded density, then p 
is absolutely continuous with a bounded density. This is a well-known result for finite-dimensional models, usually 
called a "Wegner estimate" (see Q for the original work and @, |42|, HH for mathematical presentations) . The core 
of the argument in finite dimension goes as follows: the number of eigenvalues of H L (the regularization of H in a 
box of size L) below a fixed threshold can only change by one when the potential Vi at one site is varied from its 
minimal to its maximal value. This is then shown to imply the absolute continuity of the average of N L , and finally 
of p by taking the L — > oo limit. The argument can thus be readily extended to the Bethe lattice case thanks to 
its regularization of finite size N provided by the random regular graphs on N vertices. We stress that the absolute 
continuity of the density of states should not be mistaken with the nature (localized or extended) of the spectrum for 
a given realization of the disorder. 

Wegner's argument does not apply directly to the off-diagonal case, but under the additional assumption that J has 
a Lipshitz continuous density the integrated density of states is locally Lipschitz continuous [47| (hence the density of 
states is almost everywhere defined and bounded), except possibly in the middle of the band (i.e. for E = 0). At this 
energy and in presence of off-diagonal disorder only, the density of states is known to diverge in one dimension [48j . 
The numerical estimates we obtained for the density of states of the Bethe lattice exhibit a weak non-analyticity 
around E = (see Fig. A non-rigorous analysis presented in Appendix [A] suggests the form p(E) ~ p(0) — a\E\^~ 
when E —} 0, which is in very good agreement with the numerical results. 

D. From a probability density to its moments 

In the next subsection we shall present some bounds on the large n behavior of the moments u n of the density of 
states p, that reflects the behavior of p around the edges of its support. To facilitate the intuitive understanding of 
the bounds we consider here the easy direction of the connection between p and u n , that is we recall how the behavior 
of a probability distribution at the border of its support influences the growth of its moments. 

Let us consider a probability density rj(E) supported on [E-,E + ] and its moments u n — E n rj(E)dE. In the 
large n limit the dominant contribution to the integral arises from regions closer and closer to the edges E_ and E+ , 
and thus the growth of the moments is controlled by the behavior of f](E) close to the edge E_ or E + which is largest 
in absolute value. To simplify the discussion let us assume that ij is symmetric and denote Eq = E + = —E-. Then 
one easily finds the following correspondences (n is implicitly even below): 

. If rj(E) ~ (Eq - E) a (a > 0), then i lnu n = ln(E ) - (a + 1)^ + o (^). 
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FIG. 3: Left: density of states around E — in the off-diagonal disorder case for k — 2, 3, 4. To allow for comparison between 
these different values of k the couplings have been rescaled, and taken uniformly distributed on [— \/2/k\. Right: detail 

of the cusp around E — for k — 2 and a fit to the analytic form derived heuristically in Appendix 1X1 



• If rj(E) ~ e ' 3 ^- E ° E ^ " (a,f3> 0), then ^hiu„ = ln(.Eo) — cn "+ 1 + o(n "+ 1 ), with c a constant depending 
on a and /3. 

. Ifry^-e-v^ ^ 5 "" (a, 0, 7 > 0), then I h™ n = ln(E ) - ^(j^r^ + o ({dj^)- 

In all cases the value Eq of the edge of the support controls the dominant (exponential) growth of u n , that is Eq , 
while the behavior of r/ in the neighborhood of Eq yields the subdominant corrections to (lnu n )/n. From one case to 
the next in these three examples r/(E) vanishes faster and faster as E — > Eq, and in consequence the corrections to 
(lnu„)/n decay more and more slowly as n — > oo. Note that in the last two cases the vanishing of p is fast enough 
for the integrated density of states N(E) to behave in an equivalent way at the leading order. 

The Kesten-McKay distribution of Eq. © falls in the scope of the first case with Eq — 1\fk and a = 1/2, so that 
i In u n - ln(2 y/fc) ~ -f 1 ^ (which agrees with the exact expression for the moments obtained by McKay [39[ , recalled 
in Sec. [TiTOl :. 

As explained in the introduction one expects a doubly-exponential form of the Lifshitz tail for the density of states 
on Bethe lattices, corresponding to the third case above. This has been established in a rigorous way in [3^, [36| for 
sub-critical bond percolation on the Bethe lattice, under the form 

lim lnm|m(l N{Eo -8))\ = _l 
s^o In S 2 

where N(E) is the integrated density of states. Its derivative p(E) should behave in the same way (this has been 
proven in finite dimensions in [49j]), and in consequence one should expect the correction terms in (lnti„)/n to be 
of order l/(lnn) 2 . More precisely, the heuristic reasoning presented in [l4|, EH suggests (for off-diagonal disorder) 

I— I fc+l \ 1 • 7rfc 1 / 4 (2v / fc-E)~ 1 / 2 i~ 

p(E) ~ (2Vk — E)y k ' , hence one can apply the formula above with Eq = 2y/k, a — 1/2 and 

(3 = 77 k 1 / 4 In k and make also a prediction for the coefficient of the correction term, i lnc„ — ]n{2y/k) ~ _ (^ h ^ k ) ^ ^ . 
The prediction of [l9j for diagonal disorder has the same form with Eq = 2\fk + W, a = 1/2 and /3 = Trfc 1 / 4 In A;, 

>lnfc) 2 \/fc 1 
2Vk+W (Inn) 2 ' 



which translates into i lnd n - ln(2\/fc + W) ^kf^E^_ 



E. Bounds on the moments and the integrated density of states 

Let us announce here our main results, the proofs being postponed to sections IIVI and IVl We recall that the random 
variables defining the model are assumed to follow the assumptions (OD1-OD2) in the off-diagonal disorder case (in 
particular J„ has support [—1, 1]) and (D1-D2) in the diagonal disorder case (V* is supported on [0, W]). 

a. Off-diagonal disorder For all e, x > 0, there exists Uq such that for all even n > no.' 

jTrlnfc) 2 1 1, , .„ rr. 1 (tt In k) 2 1 
- (1 + eP — T , -rr < - nc„ - n(2Vfc < — ^ '—- ^ . (11 

V ' 2 (Inn) 2 ~ n y ' ~ hn 1 2 ( nn) 2 V ' 
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b. Diagonal disorder For all e, x > 0, there exists uq such that for all n> n$: 

Jirlnk) 2 2y/k 1 1 , , , , rr s 1 (vrln/s) 2 2*fk 1 . , 

- (1 + e)- — t- 2 y=- ^ < -lnrf„ - n(2Vfc + WO < -7; ^- — tt 1 7=- n ^ ■ ( 12 ) 

V ' 2 (2Vk + W) Qnn) 2 ~ n V 7 " (Inn)- 2 (2^fc + M^) (Inn) 2 V y 

Let us first precise the level of rigor of these results: the proofs of the lowerbounds given in Sec. II V Al and Sec. IV Al 
are mathematically rigorous. The upperbounds rely on an explicit computation that we did not turn in a rigorous 
derivation, but that we checked numerically with a very high accuracy, as discussed more precisely in Sec. IIV Bl and 
Sec. EE 

Note also that in the statement of the upperbounds the multiplicative constant is actually irrelevant because of the 
condition x > 0; we chose however to write them in this suggestive form because we conjecture that the leading term 
of the asymptotic expansion corresponds to e = x = 0, i.e. that 

■lncn =ln(2Vfc) - v - ; ro+o , (13) 



n 2 (Inn) 2 \(lnn) 

and a similar conjecture in the diagonal case, in full agreement with the heuristic prediction on p discussed above. 

We now give the implication of these bounds on the moments for the behavior of the density of states itself. The 
support of the latter being bounded, p is unambiguously determined by the knowledge of all its moments [44j . The 
difficulty here is that we only have an asymptotic control on the moments, and that the very slow decay of their 
corrections hampers the use of transfer (tauberian) theorems [3] ■ It is however possible to turn the above statements 
on the moments of p into bounds on the integrated density of states N{E) = p{[— 00, E]): 

c. Off-diagonal disorder For all e, x > 0, and S small enough: 

e _ fe vs- < N ^2Vk + S) = 1 - N(2Vk-5) < e- k y J (14) 

d. Diagonal disorder For all e, x > 0, and S small enough: 

tl /4yn+ir _ fc ^ 1/4 (l)^ 



e 



x <l-N(2Vk + W-5)<e- k W (15) 



The proofs of these two statements are deferred to Appendix [Bj As mentionned in III Al the positivity of the support 
of V in hypothesis (Dl) simplifies the derivation of the bounds on the moments, but is not needed for (|15[) to hold, 
this last statement only requiring that V is bounded and gives positive weight to the neighborhood of its upperbound 
W. Also note that as before the constants on the right-hand sides above are irrelevant due to the freedom in the 
choice of x, and are only here to suggest what we believe should be the asymptotic of In | ln(l — N(Eo — 6))\, with 
Eq = 2\f~k in the off-diagonal disorder case and 2\[k + W in the diagonal disorder case. In fact, a more modest and 
concise statement is: 

lnlnlln(l-^ -^)) l= _l 
«->o In 5 2 ' K ' 

In particular this extends the result of [HI, 36] on the bond percolation model to the percolating phase. 

Finally let us emphasize that in the course of the proof of the lower bounds we shall obtain in a rigorous way that 

lim - In c n = ln(2Vk) , lim - In d n = ln(2\/fe + W) , (17) 

with n even in the first case. This is easily seen to imply (see [39| or App. [5] for more details) that the support of the 
density of states of the off-diagonal (resp. diagonal) disorder model extends up to 2^/k (resp. 2^/k + W), as expected 
from the ergodicity arguments discussed above. 

III. VARIOUS EXPRESSIONS OF THE MOMENTS 
A. Walks on the tree 

From the definition of the model on the (either finite or infinite) Bethc lattice it should be clear that the average 
moments u n (this notation encompassing both the diagonal and off-diagonal disorder cases) can be written as: 

u n = V E[H 0il H ili2 . . . H in _ l0 ] . (18) 
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This is a sum over (lazy) walks of lengths n on T/., starting and ending at its root 0, i.e. over sequences = 
ip, ii, ■ . ■ , i n -li in — of vertices of Tj., such that for all j £ [0, n— 1] either ij = or is an edge of Tj,. In 

the former case, that we shall call a self-bond step around vertex ij, the matrix element is Hi-i j+1 = Vi j . In the latter 
case Hi^i j+1 = Ji jt i j+l . As trees have no cycle each edge is visited an even (possibly null) number of times during a 
closed walk. We denote W n the set of walks of length n around the root 0, and for a walk u> = (ii, . . . , i n -i) £ W n 
we define its weight n as it(lj) = E[Hoi 1 Hi 1 i 2 . . . Hi n _ 1 o], so that u n = XLew„ 7r ( CJ )- A more explicit expression of it 
is obtained by defining n e (ui) as half the number of times the edge e £ £ is crossed during the walk oj, and s v (lj) the 
number of self-bond steps around vertex v £ V. Indeed, as the disorder is given by i.i.d. random variables, 

7t(cj) = Y[ E[J 2n = (w) ] Yl E[V"" (tj) ] . (19) 

In the off-diagonal disorder case only walks without self-bond steps have a non-zero weight; to avoid confusion we 
shall thus denote in this case A4 n the set of walks of length n without self-bond steps. 

It will be useful in the following (in particular in the proofs of the upperbounds in Sec. IIV Bl and fVBj) to refine the 
description of a walk, and to partition the set of walks W n in various subsets. We define the support <j(lj) of a walk 
lj as the set of edges of the tree visited at least once by lj, i.e. <j(lj) = {e £ £\n e (oj) > 1}, and the size of a support 
as the number of edges it contains. A support of size r is a subtree of containing the root and r edges, i.e. a tree 
of r edges where the root has at most k + 1 children, and all other vertices have at most k children. We shall denote 
S r the set of the supports of size r. 

The skeleton a(u>) is defined as the support of w, supplemented by the numbers {w e (w)} e g (T ( tJ ). More generally a 
skeleton a is made of a support a and a set of positive integers {n e } e£a ; we call 2 X^eeo- ne ^ ne l en gth of the skeleton 

cr, and denote <S£ (a) the set of skeletons of length n based on a support a of size r. 

We will also call self-support of a walk lj the set of vertices around which at least one self-bond step is taken, i.e. 
the set {v e V\s v (lj) > 1}. 

The construction of a walk lj £ Ai n , i-e. of length n without self-bond, amounts thus to the successive choices of: 

• a support a of size r < ^. 

• a positive integer n e for each edge e of the support, such that 2^ e(E(j n e = n. This completes the choice of the 
skeleton of the walk. 

• an ordering of the visited edges, that is, a mapping tp from {1, . . . , n} to a compatible with the tree structure 
and the n e . In technical terms, one must have Ve £ <r, |<y3 _1 ({e})| = 2n e and (<p(l), • ■ • , ^(n)) must correspond 
to a walk on the vertices covered by a. 

Note that there are in general several walks compatible with a given skeleton, as explained on a simple example in 
Fig. |U we shall come back to this issue in Sec lIV BIIIVCI 



s s s s 




u u u u 



FIG. 4: A skeleton a of size 2 and length 8, made of the support a — {a, b} and the number of crossings of these two edges, 
n a = 2 and rib = 2. The three vertices involved are denoted {s = 0, t, it}, on the right are drawn the three walks compatible 
with the skeleton (i.e. lj = (s, t, u, t, s, t, u, t, s), lj = (s, t, u, t, u, t, s, t, s) and lj — (s, t, s, t, u, t, u, t, s)), the numbers close to the 
edges being the order in which they are crossed from top to bottom. 

The decomposition given above can be generalized to the case where self-bond steps are allowed. Indeed, the 
definition of such a walk corresponds then to choose: 

• a number s £ [0, n] of self-bond steps (such that n — s is even). 

• a skeleton of length n — s. 

• an ordering of the edges of the support, satisfying the same constraints as given above in absence of self-bond 
steps. This yields a walk lj £ j\A n - s . 
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• s times < ti < ti < . . . t s < n — s, specifying that self-bond steps have to be inserted after t\, £2, • ■ • , t s steps 
of w. 



B. A recursive computation 



1. Recursion relation 



Taking advantage of the recursive structure of one can set up a recursive computation of u n . In this section we 
present the equations and their intuitive interpretation, their formal derivation (which is in essence a series expansion 
of the recursion ([8]) between the Green functions) being deferred to App. [C] Similar recursive equations on the 
moments have been derived in (30L [32j for sparse random matrices, where the randomness lies in the degrees of the 
vertices. 

Consider a closed walk of length n starting at the root of T k , call s the number of self-bond steps made at and 
Pi half the number of times the edge between the root and its neighbor i e [1, k + 1] is crossed. The walk will make a 
number m t of steps in each of the subtrees T^, the copy of rooted at i, such that s + X^( TO « + %Pi) ~ n - One must 
however take into account the fact that these steps are to be divided in pi closed walks on TW, separated by a visit 
to the root 0. In each of these closed walks the disorder encountered in Tw is the same, hence the contribution of 
the various closed walks on T 1 - 1 ' are correlated. Summarizing these observations one obtains the following expression 
of u n , 



u(m 1 ,p 1 )...u(m k+1 ,p k+1 )( S+P )e[J^} . . .E[J 2 p^]E[V s ] , (20) 

\s,pi,...,p k+ ij 



v ' 

s,p 1 ,m 1 ,...,p k + 1 ,m k + 1 >0 
s+m 1 +2p 1 -\ \-m k + 1 +2p k+1 



where in the multinomial coefficient we denoted p = p\ + • • • + Pk+i- This coefficient counts the number of orderings 
of the steps made each time the walk steps out the root, either for a self-bond step or towards one of the k + 1 
neighbors. The quantity u(m,p) is the average contribution of the steps made in one of the subtrees T", when m 
steps are performed in , divided in p epochs separated by visits to the ancestor 0. A more explicit interpretation 
of u(m,p) is the following: introduce an additional vertex (—1) connected only to the root of with an edge of 
weight J-ifi = 1) and define u{m,p) as the weighted sum of the walks of length m + 2p on U (—1), that start and 
end at and that are constrained to visit exactly p times the vertex (—1). Following the same reasoning as above one 
can compute the u(m,p)'s by recursion, according to the rules w(0, q) = 1, u(n,0) = 5 n ,o, 

u(n,q)= V u(m uPl )...u(m k ,p k )( ^~ 1 + S+ P \e[J 2 ^] . . M[J 2 ^]E[V S ] for q > 1 , 

s,pi,mi,...,pk,m k >0 
s+m 1 +2pi 4 Vm k +2p k =n 

( 21 ) 

where again in the multinomial coefficient p stands for p\-\ \-p k i an d this coefficient counts the number of orderings 

of the steps taken from the root of TW (there appears q — 1 because the first passage of the walk on i necessarily 
arrives from the ancestor). Note that in particular u(n, 1) is equal to u n , the moments of order n for the walks on T^. 



2. Numerical evaluation 



One can check that Eq. (|21[) does indeed provide a recursive scheme to compute all the u(n, q) : the computation 
of these values (and of the moment u n from Eq. (|20[) ) at rank n only requires the knowledge of u(m,p) with m < n 
and 2p < n. The number of terms to sum in order to obtain a new u(n,q) grows as 0(n 2k ) (0(n 2k ~ 1 ) in the off- 
diagonal case), and so the computation of u n requires a time that grows as 0{n 2k+2 ) (0(n 2k+1 ) in the off-diagonal 
case). Though this is a polynomial time complexity the exponent is high (at least 5), and in practice only rather 
limited values of n are accessible within a reasonable time on present computers. For illustration we present in Table U 
numerical results up to n = 62, in the off-diagonal disorder model with k = 2, along with the corresponding values 
for the pure case. 

A lowerbound on u n can also be obtained numerically for larger values of n: as all terms in Eq. (|21[) are positive, 
the quantities u'(n, q) obtained according to the recursion (1211) in which all sums on pi are restricted to Pi < p max , a 
threshold fixed independently of n, are smaller than the u(n, q)'s. The complexity of the computation of u'(n,q) is 
reduced to 0(n k+1 ) (0(n k ) for the off-diagonal case); for k = 2 we could reach values of n of the order of 200, using 
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n 


2 


4 


8 


16 


24 


32 


40 


50 


62 


pure 

disordered 


2 

0.67 


8 

1.06 


224 
5.37 


3.66 10 5 
375 


8.52 10 8 
4.67 10 4 


2.32 10 12 
7.93 10 6 


6.88 10 15 
1.66 10° 


1.63 10 20 

1.64 10 12 


3.12 10 25 
8.29 10 15 



TABLE I: Average value of the moments c n — u(n, 1) on the 2-ary tree Tfc = 2, in the off-diagonal disordered model with Jy 
uniformly random on [—1, 1]. The results of the pure case (Jy = 1) can be directly computed from Eq. ([25}. 



Pmax = 10. The approximation coming from the finite value of p maK becomes worse and worse as n grows larger. In 
any case, the values of n reachable numerically remains very far from the regime in which we expect the asymptotic 
scaling of the moments presented in Sec. Ill El to hold. 



C. Explicit expression 



By "unfolding" the recursive equations (|2QI2ip . i.e. replacing the u(n,q) in the r.h.s. by their recursive expressions, 
one obtains a formula for the moment u n in which the summation over the number of visits of all edges and self-bonds 
of the tree is made explicit. In order to simplify the notations let us define a labelling on the edges and vertices of 
the tree by: the root has label 0, the children of a vertex v have indices v\,...,Vk (except the root whose neighbors 
are denoted 1, . . . , k + 1) and an edge that connects a vertex v to its ancestor v' has index v. With these definitions 
and the previously introduced notations, we obtain the following expression for u n : 

u n = Y (^+n 1+ n 2 + ... + n k+ A „ fn v 1 + s v + n Vl + ■ ■ ■ + n Vk \ 2 

i ^ V s 0l n 1 ,n 2 ,...,n k+1 J f,„ n V n v - l,s v ,n Vl , . . . ,n Vk ) 

{s„>0}„ ev 

where the prime on the sum denotes the constraint sq + (2?i w + s v ) — n that has to be verified by the summands. 

v£V\0 

We also used the convention ( j g ^" n J = 6 Sj oS nit o . . . S nk ^, which arises from the boundary condition u(n, 0) = S n fi. 
More explicitly, this convention for the multinomial coefficient enforces the connected character of the set of non-zero 
elements n v , which in consequence form a valid skeleton for the walks (incidentally this also implies that the number 
of non-zero terms in the sum is finite for any finite n). This expression can be thus interpreted as a sum over skeletons 
and number of self-bond steps of a walk, the product of the multinomial coefficients counting the number of walks 
compatible with such values of {n e } e6 £, {s v } v< zv, that arises from the freedom of choice in the order the steps around 
each vertex are taken. Let us remark finally that this expression can easily be generalized to an arbitrary tree, at the 
price of slightly more cumbersome notations. 



D. The moments of the pure case 



As mentioned above the density of states is given in the pure case (Jy = 1, Vi = 0) by the Kesten-McKay law 
whose density was recalled in Eq. As a matter of fact this measure was determined by McKay [39| via the 

computation of its moments. Let us briefly recall this derivation, that we shall extend in Sec. IIV Al and fVAl to obtain 
the lowerbounds on the moments in the disordered case. We denote c° and the values of the moments on and 
Tfc respectively for this pure case, as well as u°(n, q) for the solution of Eq. (|2ip . 

Consider a closed walk on of length n, uj = (0 — io, i±, . . . , i n _i, i n — 0), and call depth of a vertex the distance 
that separates it from the root. Then the sequence (0 = do, d\, . . . , d n -i, d n = 0) of the depths of the vertices visited 
by to forms a Dyck path, i.e. they are non-negative integers, |d»+i — di\ = 1, with as final and initial values. This 
correspondence is illustrated in Fig. [5J 

The computation of the number of Dyck paths of length n is a classical exercise in combinatorics (Hfjl . [5l| , which 
yields the Catalan number („™ 2 ) nTI+T ^ or n even i otherwise. Coming back to the computation of c„, the important 

point to realize is that there are exactly k n ^ 2 walks on T/- that give the same Dyck path of length n: at every of the 
n/2 steps taken away from the root there are k possible children to choose from. Hence we obtain in the pure case 



for n even, zero otherwise. 
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FIG. 5: A walk on the 2-ary tree T2 and the corresponding Dyck path. In the left figure the numbers give the order in which 
the edges are crossed from top to bottom. 

A similar computation can also be done on T^; the depth projection of a closed walk of length n on Tfc still yields 
a Dyck path of length n. However the number of distinct walks corresponding to the same Dyck paths depends on 
the number of its returns to the origin. Indeed there are k + 1 choices for a step out of the root, and only k for steps 
taken away from the root from another vertex. The number of Dyck paths of length n with i returns to the origin 
can also be enumerated [5(| , the result being ("7^) ^37 ■ This yields the moment 

n/2 . . 

^(Vj^+i)'*"' 2 -'. <*) 

which can be checked to be the n-th moment (for n even) of the Kesten-McKay distribution of Eq. ([3]). A crude 
estimation of the asymptotic behavior of c° shows that lim ^ In c° = ln(2-\/fc), which reflects the fact that the support 
of the distribution in Eq. (j3|) is 2y/k (recall the discussion in Sec. IIIDI) . The same asymptotic behavior is readily 
found for from Eq. ([23]) . 

In the pure case one can actually find the solution of the recursive equation (|2 1 [) by similar considerations on the 
depth projections of closed walks, 

\n/2 + qj n + q 

for n even, zero otherwise. We show in Appendix [D] an explicit verification of this statement. 
Let us make a final series of remarks before closing this section: 

• the projection from a walk on a tree to the Dyck path representing the distance from the root to the walker 
is a powerful tool in the pure case because it is easy to count the number of walks corresponding to a given 
Dyck path. Unfortunately it is much harder to compute the total weight of walks sharing a Dyck path in the 
(off-diagonal) disorder case: the disorder is kept fixed along the walk, this induces correlations between the 
choices of the branch followed by the walk which are lost in the projection on its depth. 

• one can try to apply a saddle-point estimate to the recursion equation (|21|) : writing u(n, q = nx) ~ e. n f( x ) gives 
a functional equation on f(x) whose solution is found to be, independently of the disorder, the one of the pure 
case f{x) = Ink + (2 + x) ln(2 +x) — (1 + x) ln(l + a;). However, we were not able to get the non-exponential 
corrections to this formula in the disordered case, as the difference in (|2ip between the pure and the disordered 
case is only in polynomials factors, whereas according to III El we expect a slower dominant change in u(n,q). 
Another difficulty is that the corrections to u(n,q) do not depend much on the particular shape of disorder, 
whereas (|21[) seems to. An alternative approach that we did not pursue would be to seek a specific distribution 
of the disorder such that its moments make Eq. (I21[) solvable with a closed combinatorial form for u{n, q). 

• the asymptotic behavior of the "shifted pure case" where J,j = 1, Vi — W a fixed constant, for which the 
moments are denoted <i°, can be obtained either by shifting the density of Eq. <j3j> to pkm(E — W) or by 
depth projections leading to Motzkin paths (see App. [Elfor a definition) with a weight W assigned to self-bond 
(horizontal) steps. This yields limiln<i° = ln(2vfc + W), a result which shall be used for comparison in the 
discussion of the diagonal disorder model. 

IV. PROOFS FOR THE OFF-DIAGONAL DISORDER MODEL 

This section contains the proofs of the bounds stated in Eq. (fTTj) for the off-diagonal disorder case, i.e. for vertex 
random variables V, all vanishing, and random edge couplings Jij drawn in [—1, 1] with a probability distribution 
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satisfying the assumptions (OD1-OD2) of Sec. Ill Al We denote c„ the moments of the density of states, and assume 
implicitly below that n is even, as all odd moments obviously vanish in this case, the length of a closed walk on a tree 
being by definition even. 



A. Lower bound 



Let us first explain the strategy underlying the proof of the lowerbound, reminiscent of the procedure for finite- 
dimensional models sketched in the introduction. We shall indeed lowerbound the moments c n of the density of states 
by restricting the contributions of walks to those confined to a small volume around the root (at a distance smaller 
than h), under the condition that the disorder takes a large value in this volume (i.e. all | Jij | shall be greater than Jo). 
An optimal compromise will then be found between these two restrictions: taking h too small reduces too drastically 
the possible contributions to c„, while the probability of having large disorder falls off quickly if h is taken too large. 
We now turn this reasoning into an explicit derivation. 

We recall that c„ = XLeA-i 7r ( w )' wnere Mn denotes the set of closed walks (with no self-bond steps) of length n 

starting at the root of T&. The weight of a walk tt(uj) is given by E[Jl e Je ], where e runs over the edges of Tfc 
and n e (oj) is a non-negative integer equal to half the number of times the walk crosses the edge e. As all weights tt(uj) 
are positive, c n can be lowerbounded by restricting the sum to M n , the walks on Tfc, and further to M. n ,h, the walks 
on Tfc which go at a distance at most h from the root: 



> Cn = tt(w) > ^2 7r(w) 



(26) 



Let us denote £h the set of edges with endpoints at distance at most h from the root of Tfc, M the event: {\J e \ > 
Jo Ve € £h}, where Jo is an arbitrary threshold with < Jo < 1, and M the complementary event. For an arbitrary 
walk u) in Ai n ,h we have 



7r(cj) = E 



> E 



ee£ h 



M 



-ee£h 

> Jo i n\j\> M l£hl 



P[M] + E 
P[M] 



-e££h 



M 



P[M] 



(27) 



where in the last step we have used the independence of the J e on different edges and the fact that J2 e 2™e(w) = n. 
This lowerbound being independent of tu, it is now enough to control the number |At n ,fc| of walks on Tfc which remain 
at a distance smaller or equal to h from the root. Using the projection from a walk to its depth explained in Sec. lIII Dl 
one realizes that |A4 n ,/i| = k n ^ 2 m nt h, where m n ,h is the number of Dyck paths of length n and of height at most h. 
These paths can be enumerated with standard combinatorial techniques [5ll l52j : in particular we show in App. [E] 
(along with a more precise estimate of its asymptotic behavior) that m„^ obeys the inequality 



m n . h > 



h + 2 



2 cos 



(28) 



valid for all values of h and of (even) n. We can thus write 



c„> 4 i n\j\> M l£hl 



h + 2 



2Vk 



cos 



h + 2 



(29) 



or equivalently after taking logarithm, subtracting the leading order of the pure case and using the fact that \£h\ = 



fe-i 



< 



fe-i 



- In Cn - ln(2\/fc) > ln( J ) + 



k h+i 



n{k- 1) 



In 



3 (2 

> Jo] + - In t—^ 
n \ h + 2 



In cos 



h + 2 



(30) 
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From this equation we shall first show that lim — lnc„ = ln(2vfc)- Fixing h and Jo and letting n — ¥ oo in the 
inequality above yields 



1 / 7T 

lim inf — lnc n — ln(2vfc) > ln(J ) + In cos ( - 

n->cxD n V h + 2 



(31) 



Sending now J to 1 and h to infinity we obtain lim inf ^lnc n > hi(2y/k). On the other hand we obviously have 
Cn < c n , the value in the pure case where all Jij = 1, and the results of McKay [1^] recalled in Sec. IIII Dl implies that 
— lnc° — > ln(2-\/fe). From these matching lower and upper bounds it follows that linv^oo i lnc„ = ln(2\/fc), i.e. that 
the edge of the density of states is the same as in the pure case, as expected for the ergodicity reasons discussed in 
Sec. Ill Bl Note that the only assumption on the distribution of J which was necessary here is the existence of some 
Jm in > such that P[|J| > J ] > for all J with J min < J < 1. 

The more precise result stated in Eq. (1111) is obtained by taking h — > oo and Jo — > 1 in arL n-dependent way. By 
inspection of the last term in the r.h.s. of Eq. (|30l) one realizes that the best bound will be achieved by taking h as 
large as possible. The limitation on the possible range of h comes instead from the second term, in which k h+1 has 
to be small compared to n. We shall in consequence take h = [alnnj with a < 1/lnfc and Jo = 1 — 

y > is an arbitrary positive constant. In the limit n — > oo the r.h.s. of Eq. (|30j) becomes 



where 



1 



1 



2a 2 (Inn) 2 fc — 1 n 1 ~ Qlnfc 



In I 



UI > 1- 



1 



(lnn) 2 +2> 



O 



1 



(32) 



For all values of a < 1/ln/c and y > the second term is, thanks to the assumption (OD2) on the random variable 
J, negligible with respect to the first, which proves the statement of the lowerbound in Eq. (fTT|) . 

If the assumption (OD2) were violated the dominant effect at the origin of the reduction of the density of states 
near the edge would be the extremely low probability for a single \Jij \ to be close to 1, not the collective effect of all 
| Jij | in a given volume being large. In that case the form of the lowerbound in Eq. ()11|) would have to be modified 
and would depend on the precise form of the density of J around 1. The transition from the collective to the single 
edge dominated regime of the Lifshitz tail phenomenon, depending on the form of the edge strength distribution near 
its edge, has been studied in the finite-dimensional case in The methods developed in this paper, in particular for 
the upperbound proofs, do not seem to us well adapted to the single edge dominated regime, which shall not been 
discussed further here. 



B. Upper bound 

The derivation of the upperbound in Eq. (jlll) will proceed by a decomposition of the sum over the walks ui G M n 
according to the size |o"(w)| of their support, i.e. the number of edges of the tree visited at least once by ui (recall the 
definitions given in Sec. IIII A|) . Indeed, the weight of a walk can be upperbounded as follows: 

tt(w)= n E[J 2n ^]< Yl W 2 ] =E[J 2 ] |CT(w)l , (33) 

where we have used the facts that | J| < 1 and that (by definition of the support) n e (u) > 1 for e € a(uS). By our 
assumption (OD1) the variance E[J 2 ] is strictly smaller than 1, hence the weight of a walk is exponentially small 
in the size of its support. The strategy of the proof corresponds then to choose in an optimal way a (n-dependent) 
threshold on the support's size, use the bound above for walks with larger supports, and upperbound the number of 
walks with smaller supports. 

Let r m (n) be an increasing (integer) function of n to be fixed later, such that limr m (n) = oo and lim rm ^"' ) = as 
n — > oo. We define C M. n as the set of walks oj whose support has size at most r m (n). Using the obvious fact 
that 7v(uj) < 1 for any walk, we obtain 

c n < E[J 2 ] r ^\M n \M%\ + \M%\ < E[J 2 ]^(")|M„| + \M^\ . (34) 
According to the discussion of Sec. IIII A[ the number of walks in M~ can be written as 



r m (n) 

1^1= E E E 

r=l a£S r SeSr( a ) 



(35) 
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where S r is the set of supports of size r, S^(a) the set of skeletons of length n based on a support a of size r, and 
77(17) the number of walks compatible with such a skeleton. The rest of the proof will rely on the following statement 
about the maximal value n(n) of the combinatorial factor n(a): 

Let k{ti) = max{?7((j)|r £ [l,n/2],<7 £ S r ,a £ 5^(ct)}. TTien for all y > and a < (7rlnfc) 2 /2 ; it holds for n large 
enough that: 

(2Vk) n e~ < «(n) < (2%/fc)"e _Q ( 1 ^ . (36) 

It should be emphasized that the total number of walks of length n is, at the leading exponential order, (2^/k) n , 
so that the previous statement shows that an unique skeleton a with very large combinatorial factor rj(a) contributes 
to the leading exponential order to the sum over all walks. In the next subsection we explain how to obtain these 
bounds on n(n); unfortunately we were not able to derive the upperbound in Eq. (|3l>|) in a fully rigorous way, yet it 
is the result of an analytical computation that we also checked numerically with high precision. Hence we continue 
here the derivation assuming that Eq. (|36|) holds true. 

Going back to Eq. (|35|) and using the definition of k(ti) yields 

r m (n) 

\M%\<K(n) ]T ]T \S r n (a)\. (37) 

r=l a£S r 

It is easy to see that |iS£(c)| = ("^-"i" 1 ) independently of cr: this is the number of ways to choose r positive integers 
(the {n e }eecr) which sum to n/2. The numbers \S r \ of supports of size r are clearly increasing with r. Moreover 
r m (n) < n/2 for n large enough, hence the terms of the sum in Eq. (|37p can be upperbounded by their value in r m (n): 



<n{n)r m {n)( n/2 1 W-<">| . (38) 



The numbers \S r \ of supports of size r can be enumerated via their generating function F(x) = J2 r \S r \ xT - A support 
of size r being a subtree of Tfe of r edges, it is composed of a number j £ [0, k + 1] of edges around the root, along 
with j subtrees of the copy of rooted at the neighbors of the root. This implies that F(x) — (1 + xF(x)) k+1 , 
where F is the equivalent of F for T/., solution of F(x) = (1 + xF(x)) k . The asymptotic behavior of |<S r | for large 
r can be inferred from the analysis of the singularities of the generating function (see Theorem VII. 3 in 51]). An 
elementary study of the reciprocal function x(F) = (F x f k — 1)/F shows that it has a maximum equal to l/7/c, with 
7fc = 1 hence F(x) has a square-root singularity in x — l/7fe. As F(x) is known explicitly in terms of F one 

obtains easily 



F(x) = A k - B kv /1 - lk x + 0(1 - 7fe x) asx^l/7 fc , (39) 

where A k and B k are two positive constants whose precise expression will not be useful in the following. This 
singularity is then translated in terms of the |<S r | as |51j |: 

|^| = -7=7^(1 + 0(r- 1 )) . (40) 



We now choose an arbitrary x > 0, and set r m (n) — [ n^p+i J ■ With this choice for the maximal size of the 
supports in , one has 



n 



m( n fr\V m(n) i 

\r m [n) - IJ 



lnln? t 1 

= °(^ff)- (41) 



Hence the two leading orders of the inequality in (|38[) arises from n(n). Thanks to the freedom of choice of a in 
Eq. (|3"6")l we conclude that for all j3 < (7rlnfc) 2 /2 and n large enough: 

\M%\ < {2Vk) n e' ^ . (42) 
Dividing Eq. (|34|) by c° = |A^„|, the moment of the pure-case, one obtains for n large enough, 

^< £ &J^ ! » + r ^ , (43) 

P u 
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using the fact that the sub-exponential corrections to c£ are neglig ible here (recall that \ ln(c°) = ln(2AA) + 0(— )). 
In this last inequality the first term is the dominating one, and leads to 

- ln(c n ) < ln(2Vfc) - * lnlElJ 2 ]- 1 ) . (44) 
n [mny +x 

As x is constrained to be positive the (positive) constant ln(E[J 2 ] -1 ) is irrelevant for n large enough; this yields the 
upperbound stated in Eq. (ITT|) . 

Let us finally comment on the possible improvement of this upperbound. With a better estimate of \A4 n \A4~\ one 
should try to make the threshold function r m (n) grow faster with n. The limiting factor would then be the allowed 
range of (3 in the second term of Eq. (l43l) . f3 < (7rlnfc) 2 /2; this would actually be the best one can hope for, as a 
greater value of (3 would contradict the lowerbound in Eq. (fTTj) . 



C. Combinatorial factor 



We shall now present our arguments in favor of the right inequality in (|36[) . We recall that the expression of r/ (a), 
i.e. the number of walks compatible with the skeleton a, is given by the product of the multinomial coefficients in 
Eq. (|22|) . The maximal value of rj can thus be written as 



n(n) — max max max 

re[l,n/2] v£S r {n c >0} cE =„ 
2 2~2 n e —n 



ni + n 2 H h n k+1 

ni,n 2 , . . . ,n k+ i 



n 

vE<t\0 



n v - 1 



1, n Vl , . . . , n Vk 



(45) 



where we follow the labelling of introduced in Sec. IIII C( giving to an edge the index of its endpoint vertex most 
distant from the root. We also set by convention n e — if e ^ a. 

An upperbound on k(ti) is obtained by relaxing the constraint on the possible values of the variables {n e }. We 
shall thus extend their domain to non-negative reals, denoted x e to avoid confusion, using the natural extension 
T(x + 1) = x\ from integer x to arbitrary reals. We also let the set of edges with positive values of x e to be an 
arbitrary subset of £ p , which contains the edges with endpoint at distance at most p from the root. For the ease of 
notation we introduce >c(n) = lnft(n), which is thus upperbounded as 



n(ri) < max sup 

pe[l,ra/2] {x c >l} ee£p 
2Y. x c =n 



lnT(l + xi + x 2 



^ [^T(x v 



x k +i) - lnT(l + xi) lnT(l + x k+1 ) 

H h x Vk ) - ]ilT(x v ) - lnr(l +x Vl ) — lnr(l + x Vk )] 



(46) 



We expect this upperbound to become tight at the leading order when n — > oo: the integer values of n e achieving the 
maximum in Eq. (|45p will become large as well, hence relaxing them to be real numbers should have a minor effect. 

For a given p the supremum is over a smooth function of the \£ p \ reals x e ; assuming that the supremum is reached 
in the interior of its domain, one has to look for the critical points of the function. There is always one radially 
symmetric critical point, i.e. where x e depends on e only through the depth i of the edge; we denote Xi the common 
value of x e for all edges at depth i. We assume that this is the global maximum of the function. This yields 



x(n) < x*(n) = max sup K p (x\, 

pe[l,n/2] x!,...,x p >0 

2 Y2i=l v iXi—n 



(47) 



where we defined the radially symmetric function K p as 

p-i 



if p (xi, ...,%) = } Vi[hiT(xj + fcxj+i) - lnr(xj) - fclnT(l +x i+ i)] . 



(48) 



For i > 1 we denoted Vi = (k + l)k % 1 the number of edges of depth i, and we set by convention vo = 1 and xq = x"i + l. 
For a given p the critical point of K p is achieved at the solution of: 



ip(xi + kxi+i) — ip(xi) + ip(xi-x + kxi) — ip(xi + 1) = A for i £ [l,p] with xq = 1 + x\ , x p +\ = 



(49) 
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FIG. 6: Plot of the function x*(n) denned in Eq. (I47|l . the symbols are the result of the numerical evaluation of >r*(n)/n, the 
line is the conjecture (|51|l . These results are for k — 2. 



where A is a Lagrange multiplier that has to be fixed to the value enforcing the constraint ViXi = n/2, and 

■0 = d j" r is the digamma function. Note that x* is defined by extension for real values of n, and that A plays the 
role of a parameter conjugated to n. The previous equation can be rewritten as a three terms recurrence on x^: 

Xi+i = r [—Xi + ip^ 1 (X + ij){xi) + t/j(xi + 1) - ip(xi-i + kxi))] . (50) 

We solved numerically the optimization problem defined in Eq. (|47[) : the plot in Fig. [5] display the values of x*(n)/n 
we found in this way, as well as the good agreement with our conjecture on its asymptotic behavior: 

n 2{inn) z \(mn) z J 



which implies the upperbound stated in Eq. p6[) . We turn now to the computation that led us to the formula in 
Eq. (|51[) . In the large n limit most of the non-zero Xi can be expected to be large (this is also confirmed by the 
numerical computation of x*). In consequence we can simplify the recurrence relation (|50[) using the first term of 
the asymptotic expansion of the digamma function, ip(x) = ln(x) + 0(l/x). The resulting relation is more compactly 
written in terms of a; = , which is found to obey 

oti+x = f(ai, A) , with f(a, A) = ^ ( -1 + € j . (52) 

At the order of our approximation the initial condition oi\ = is equal to ai = 1. The fixed-point equation 

a = f(a, A) undergoes a bifurcation transition at a critical value A c = \n(4k) (see left panel in Fig. [7]). For A > A c 
there are two solutions which coalesce in a* = | at A c , and disappear for A < A c . This bifurcation has the following 
consequences on the solution of Eq. (|52[) with initial condition a± = 1, that for clarity we denote a^(A) to emphasize 
its dependency on the parameter A: 

• right at the transition the sequence decays towards its fixed-point as 1/i, more precisely 

a,(A c ) = \ + \\ +0 (j) ■ ( 53 ) 

• when A — > A,T a plateau develops around the avoided fixed-point a* = 1/fc, with a diverging length of order 
(A c — A) -1 / 2 , see the right panel of Fig. [7] One can establish the following scaling behavior by zooming around 
the plateau: 



lim 

A-J-A7 



\/A c — A ( 1 v*c-a ^ ^ k 



-Htan(^-|) V 0e]O,7r[ . (54) 



This in particular shows that the length of the plateau is ir/\/\ c — A at the leading order. We shall use this 
scaling behavior under the less precise but more readable form: 

a, « ^ - |\/Ac - Atan (i\/~X c - A - ^ , (55) 



where it is understood that i is in the scaling regime, i.e. of the form 6/\/X c — A with 9 g]0,7r[. 
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Note that there is a matching condition between these two regimes: if one considers a value of i very large but finite 
with respect to (A c — A) -1 / 2 , the same behavior of aj(A c ) is obtained by taking i —¥ oo in Eq. (|53[) or A — > A c with 
i fixed in Eq. (|55[) . The above behaviors are generic for all recursion of the forms cti+i = f(o>i, A) that encounters a 
bifurcation transition, and the quantitative statements only depend on the value of the partial derivatives d\f and 
d 2 a f of / in (a*, A c ). It turns out in addition that for the specific function / of Eq. (|52p the critical sequence can be 
computed exactly, Q!j(A c ) = ((fc + 1) + (k — l)i)/(k(3 — k) + k(k — 

Let us now investigate the consequence of these behaviors of a, (A) on the properties of the Xi (A). The leading 
order of the recurrence relation (|50p only constrains the ratio on between successive terms in the sequence {xi}, in 
consequence the initial condition x\ is at this point a free parameter that we shall fix afterwards. By the definition 
of on we have 



Xj(A) =xi[- 



II(^(A)) 



(56) 



j=2 



where we have factorized the plateau value 1/fc of the a,. Consider first the large i regime at the transition, i.e. for 
A = A c . As a consequence of (|55|) one has Y^)=2 ha(fc<x,(A c )) = 21n(i) + ln(Cfc) + 0(l/i), hence 

i-l 



Xi(A c ) = x\ 



i 2 C k (l + 0(l/i)) , 



(57) 



where C k is a positive fc-dependent constant. Actually one can compute C'k explicitly thanks to the exact expression 
of aj(Ac), and find C k = (k — l) 2 /(2fc(fc + 1)). Let us now turn to the scaling regime A — > \~ where Eq. (f55|) is valid. 
Consider two indices iq < i of the form zq = 9q / V A c — A and i = 6/\/\ c — A. One has 



i(A) = x io (X) 



x io (A) 



exp 



exp 



i 

-2yJ\ c - A ^2 tan (jy/\ c 



j=i + l 



(58) 



(59) 



where we have used Eq. (I55p in the second line. Now in the limit A — > A c the term in square brackets is the Riemann 
discretization of the integral of tan(# — tt/2), hence 



Xi(X) « x io (A)fc 



to — 1 



9o) Vfc 



sin 2 (6») 



(60) 



Finally we invoke a matching argument as explained above : we take io large but finite with respect to (A — A c ) 1 / 2 , 
so that #o — > 0, and we replace the value of Xi by its value computed at A c in Eq. (|57p . This yields 



Xi(X) 



xiCk ( 1 



A r — A \ k 



i-l 



sin 2 (iy/X c - A) 



(61) 
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e = tVx^J 



FIG. 8: Verification of Eq. (|6ip for k — 2, A = A c — 4.10~ 5 . The solid line is the analytical prediction, the symbols are the 
Xi obtained from the resolution of the optimization problem of Eq. (I47|l . and are indistinguishable on this scale from their 
approximate computation in terms of the a^. Note that there is no fitting parameter in this figure. 



valid in the scaling regime i = 6 / \J\ C — A with 8 g]0,7t[. In Fig. [8] we present a numerical confirmation of this 
prediction. 

As can be seen on the right panel of Fig. once the the sequence at exits the scaling regime around the plateau 
a finite number of additional iterations brings it to negative values. Let us call p(X) the largest value of i such that 
ai(\) is positive. From the observation above, p(X) — ir/\/ A c — A at the leading order when A — > X~ . Let us now 
discuss the scaling of x\. For the computation above to be consistent with the replacement of tp(x) by ln(x) it was 
based upon, one should have Xi(X) large for most of the i's between 1 and p(X). On the other hand one should impose 
in an approximated way the boundary condition x p ^ + i = of the exact system (|49p . In consequence we shall choose 
xi such that Xi is of order one for i at the end of the scaling regime (i = ir~ jy/ A c — A), hence at the leading order 
(within multiplicative constant and sub-exponential corrections in (A c — A) -1 / 2 ) 

(Trlnfc) 2 
(In x\f 



i 1 (A)«fcvra j L e. A c - A ~ \_ 5 > . (62) 



This ends our determination of the asymptotic form of the optimal value of the sequence {sci}fl^ as a function of A; 
from it we shall now compute the corresponding values of n and . The former reads 

' ^ ' - - ' k + 1 C kXl {X) 

(63) 



i(A) = 2{k + 1) £) k%{X) » 2 fc + 1 ^i)^ [ d9 sin2 (^) = - 



k (A c - A) 3 / 2 



where we replaced the sum over i by an integral over 9 using the form (I6ip of the Xi in the scaling regime. The leading 
behavior of n is thus dictated by the one of Xi(X), and the relation (|6"2"]) can thus be re-expressed as 

(^lnfc) 2 
(inn) z 

To compute the value of >r* achieved in the large n (or equivalently A — > A~ ) limit we use the asymptotic expansion 
lnT(a;) = xlnx — x + O(lnx) (consistently with our approximation of ip(x)), the term in square brackets in (|48[) 
reading 

Xi[ln(4k)+p i+1 ln(2k) + (2 + i+1 )ln(l + ^] - (1 + ft+i) ln(l + + 0(1x1%) , (65) 

where we defined /Jj by an = r(l + A)- Once inserted in the sum over i the first term leads to a factor proportional 
to u, 

x«(n) « nln(2Vfc) + ln(2fc) + (2 + /3 i+1 ) In M + £±i) - (1 + In(l + A+i)] • (66) 

In the scaling regime /3j = — 2\/A c — Atan(i-^A C — A — ^) is small, we thus expand the previous expression to second 
order in /?, and trade the sum with an integral to obtain 

fc+1 Xi(A)Cfc r Jfl „ : „2^T o^or.vA x.„„^ M M XN2.„„2^ ^ 

(67) 



«,(») « nln(2Vk) + ± ^ ^3% y d0 sin 2 (0) [-2 ln(2fc) v/A~Atan (fl - ~) - (A c - A) 2 tan 2 (o - ~) 
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The integral of the first term vanishes for symmetry reasons, the second one is easily evaluated, and by comparison 
with the expression of n given in (1631) one obtains 

«ln(2V3fc)-J(Ac-A) ■ (68) 
n 2 

With the expression of A c — A as a function of n given in this completes the derivation of (|51j) . 

Finally the lowerbound in (|36j) can easily be justified by contradiction: suppose there exists x > such that 
for arbitrarily large n, x(n) < (2\fk) n e ( |n »)^* . Then one could repeat the derivation of Sec. IIVB1 choosing 
r m (n) = \ — — " 2 -§ J i an d obtain instead of (|44|) the upperbound 

- ln(c„) < ln(2Vfc) - * ^(EfJ 2 ]- 1 ) . (69) 
This would violate the lowerbound in (fTTj) , hence the contradiction. 



V. PROOFS FOR THE DIAGONAL DISORDER MODEL 



This section contains the proofs of the bounds stated in Eq. (fT2j) for the moments d n of the diagonal disorder 
case, i.e. for edge couplings deterministically equal to 1 and vertex random variables Vi drawn in [0, W] with a 
probability distribution satisfying the assumptions (D1-D2) of Sec. Ill Al The proofs are similar to the off-diagonal 
case, we mostly precise the additional technicalities that arises here. 



A. Lower bound 



As in the off-diagonal case the lower bound on d n follows by considering only the walks restricted to a neighborhood 
of depth h around the root of the tree, and by conditioning on the event that the disorder is large in this neighborhood. 
We recall that d n = X^ew 7r ( u; )' where W n denotes now the set of closed walks on Tfc of length n with self-bond 

steps allowed, and the weight of a walk is it{oS) = lEUIuev ]) with s v (uj) the number of self-bond steps taken by 

the walk around the vertex v of the tree. These weights being positive we can lowerbound d n as 

dn > ' ( 70 ) 

where W n ,h is the set of walks on that visit vertices at distance at most h from the root. We call Vh the set of such 
vertices, introduce a positive threshold Vo < W, and define the event M as {V v > Vo Vi; € Vh}- With a reasoning 
similar to the one which yielded Eq. (|27fl we obtain 

7T(W) > P[V > ^ ]' Vhl ( 71 ) 

for all walks w £ W n ,h, where we defined Stot(w) = J2 V s v( UJ ) the total number of self-bond steps taken by the walk. 
Putting these two inequalities together one has 

dn > F[V > V a ] lVhl V Statiul} . (72) 

We now use the projection from a walk to its depth first introduced in Sec. IIII Dl The self-bond steps are then 
associated to horizontal steps in the path, which is called a Motzkin path in this case. The sum over uj in the last 
equation is thus equal to the weighted sum over Motzkin path of length n, where ascending steps have weight k (the 
number of branches the walk can choose from in a step away from the root) and horizontal (resp. descending) steps 
have weight Vo (resp. 1 ). This quantity m n ,h is computed by combinatorial techniques in App. |EJ The number of 
vertices at depth at most h is \Vh\ = \_i > hence 

1 1 k h+1 

-lnd n >-lnm n , h + — -ln¥[V >V ] . (73) 

Tl Tl fX\ rZ J- J 
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If we let n —> oo with h and Vq fixed, the asymptotic properties of m n ,h proved in App. [E] (see Eq. (|E8|I ) yields 

liminf-lndn > In f U + 2\/fccos ( - ) ) . (74) 
rn-oo rt \ \h + 2 / / 

Letting finally h oo and Vb — > IU and noting that d n < d n , the shifted pure model with = 1 and Vi — W for 
which lim ^ lnd° = ln^V^ + W) allows to conclude lim — lnd„ = ln(2\/fc + W). The upper limit of the support is 
thus + W in the diagonal disorder case, in agreement with the ergodicity arguments sketched in Sec. Ill Bl 

We now consider the limit n — > oo, taking the parameters h — > oo and Vb — > W in an n-dependent way. For the 
same reasons as in the off-diagonal case the optimal scaling of the parameters is h — [a In n\ with a < 1/ In k and 
Vq = W — (j n ^yi+ v where y > is an arbitrary positive constant. Using the asymptotic expansion (|E9I) for m ni h, the 
r.h.s. of ([73| becomes 

+ o(—^—) . (75) 
\(\nn) 2 +y ) v ; 

The assumption (D2) on the random variable V ensures that the third term is negligible for any a < 1/lnfc and 
y > 0, which concludes the proof of the lower bound in Eq. ([12]). 



\n(2Vk + W) 



■wWk 1 
a 2 (2Vk + W) (Inn) 2 



k — In 



l — a In k 



In I 



V > W - 



(Inn 



2+y 



B. Upper bound 



We present now the proof of the upper bound in Eq. (|12|) . Let us first recall the definition of the moment of the 
shifted pure case (Jjy =l,Vi = W) in terms of the weights 7r°(w) of the walks: 



< = e 7r °( cj ) ' n ° M = n w/s,,(w) = w tot( " ) 



(76) 



d n being equal to (2vfc + W) n within polynomial corrections. Consider now the effect of the disorder on the weight 
of a walk: 



tt(w) - Y[ E[U S ^ } ] = tt°(w) JJ E 



< 7T°H 



V_ 

w 



r(u>) 



(77) 



where t(w) = |{v € V|s„(c*;) > 1} is the size of the self-support of the walk, as defined in Sec. lIII Al By our assumption 
(Dl) E[V/W] is strictly smaller than 1, hence the weight of a walk is exponentially suppressed in the size of its self- 
support. The proof will thus be based on partitioning the set of walks in W n between those with a large self-support, 
for which we shall use the bound above, and on bounding the contribution of the walks with small self-supports. 

Let us introduce a growing integer function i m (n), with t m (n) — > oo and * m (") — >. as n — > oo, and denote the 
subset of W n which contains the walks whose self-support contains less than t m (n) vertices, = {uj £ yV„|r(w) < 
t m (n)}. Then 



d n < E 



V 

w 



1\ tmM 



E *V) + E A**) < 



E 



V 

w 



tm(n) 



E A* 



(78) 



where we have used the obvious facts that tt(uj) < 7r°(w) and tt°(uj) > for all uj. 

We shall further partition according to the size of the supports of the walks (i.e. the number of edges which 
are crossed at least once by the walk). Let us introduce another growing threshold function r m (n) with r m (n) — > oo 
and ^ o, and define W n = {uj £ W|||er(w)| < r m (n)} and W n = {uj £ W£\\ct(uj)\ > r m (n)}. We can thus refine 

the bound above as 



(e 


~v~ 






) 




w _ 





tm(n) 



E 



E 



7r°H . 



(79) 



We shall work out separately upperbounds for the contributions of and W 2 . Let us start with the former, and 
introduce a larger set of walks = {ui £ W n ||cr(a;)| < r ro (n)}, in which we have removed the constraint on the 
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size of the self-supports. By the positivity of the weights the sum over dominates the one on W*. The crucial 
point is now to realize that there is a (™) to 1 mapping between each walk uj G W n with s tot (w) = s and the walk 
ZJ € M. n -s obtained by removing all self-bond steps, that in consequence shares the same skeleton. This binomial 
coefficient counts the number of possible choices for the s times < t\ < ■ ■ ■ < t s < n — s at which a self-bond step is 
inserted in ZJ to construct to. For completeness we give in App. |F]a more formal proof of this fact based on the study 
of the combinatorial factors. We can thus write 

£ *>) < £ vrV) = f; ( n )w\Mt.\ , (80) 

where M.~_ a is the set of walks of length n — s, without self-bond steps, studied in Sec. lIVBl We shall thus exploit the 
bound on \A4^\ derived in that section: we choose an arbitrary x > 0, (3 < (lnfc) 2 7r 2 /2 and set r m (n) — l_ ( lnT ^!i+°J • 
From Eq. (|42p we obtain the existence of an (even) no such that, for n — s > no, 

\M^_ 8 \ < (2Vfc)"~ s e" /3 <M^V < (2Vk) n ~ s e~ P T^ ; ( 81 ) 

this bound being trivially true for odd values of n — s for which |./V^_ S | = 0. Inserting this bound in the inequality 
above yields 



^/H< ^ r r(2Vfc)"V^+ ]T r)w s \Mt s \ (82) 

^ E ( n )w s (2VkT- s e~?^ + \Ml\j2( n )w s (83) 

c— n V / c— n V / 



s=0 x 7 s=0 



o^(2Vk + WeT'o^) n + \M^\{l + W) n . (84) 

From the first to the second line we have used the fact that \A4^\ grows with (even) n and extended the range of the 
sums, all terms being non-negative. As uq is now fixed and 2y/k > 1 the second term is exponentially smaller than 
the first one in the large n limit. Moreover the first one can be expanded as 



e f3 ^W(2^/k + We 13 ^ 1 ) 71 = e P {2y/k + W) n (l + 2 J^ +W (e^^ 1 - lj) 



(85) 



= { 2Vk + Wre^^^ +0(n/{lnnr) . (86) 
We can thus conclude the study of the sum over Wl, : for all 7 < ^ 7rl " fc - ) 7=^— it holds for n large enough that 

-L £ ttV) < e^<&? . (87) 

Let us finally derive an upperbound on the contributions of the walks in the set W 2 whose support contains more 
than r m (n) edges while their self-support contains less than i m (n) vertices. We write this sum as 

n-2r m (n)-2 n/2 

E 7r °( w )= E ^ s E E E ms,A), m 

ue\V% s=0 r=r m (n)+l WJM^, .4Ccr(ZJ)U0 

|-A|<tm(n) 

where is the set of walks of length n without self-bond steps with a support of r edges, and E(ZJ, s,A) is the 
number of walks U) G W n that are obtained from ZJ G Ad n — S by adding to it s self-bond steps, in such a way that 
the self-support of uj is a given A, a subset of the vertices visited by ZJ. A short combinatorial reasoning presented in 
App. [F] yields the following upperbound for this quantity 

^,.^)<f B - 2 "^l + ^ + 1 ^V (89) 
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In all the terms of (|88|) we have |c(u;)| > r m (n) + 1 and \A\ < t m (n); moreover the number of terms in the sum over 
A can be coarsely upperbounded by t m (n)( t "„))■ This being done one can relax the constraint over r and sum over 

all walks 57 £ .M n _ s , which are no more numerous than (2v / fc)"" s . This yields 

<n —2r (n) — 2 

' n \ /n-2r m (n)-2 + (fc + l)t ro (n) 



^,»< V W s (2Vkr-°t m (n)( ' * ' m W-^^^WM. (90) 



We choose now the threshold function on the self-support size as t m (n) — [ ^ ^2+2* J > which is in consequence negligible 
with respect to r m (n) as n — > oo. In order to perform the sum over s we decompose the last binomial coefficient as 

n — 2r m (n) -2 + (k + l)t m (n)\ /n - 2r m (n) - 2\ (fc+ ^ m(n) n _ 2r m (n) - 2 + i 



s / \ s 



n-2r m (n)-2- s + i 1 ' 

fn - 2r m (n) 2\ 

1 s ) {{k + l)f m (n))! ' 1 J 



This yields 

2r m (n)+2 

^n)^ ((fc+ l)t m (n))!^ v ' " ; \2y/k + W 



V 7r°(") < *„(„) f " -g^^l- (2Vfc + W)» f ^ ) . (93) 

KtrriMj ((k+ i)tm(ri))\ \ 7\/k. + W I 



Using standard bounds on factorials and binomial coefficients and the fact that t m <C r m , we can conclude on the 
existence of /z > such that for n large enough 

-L J] A")^-"*^ . (94) 
Finally, putting together (|79|) . (|87|) and ((94]) , we obtain for n large enough: 





'V 






) 




W_ 





The first term dominates when n — > oo, hence we conclude that (renaming 2x in x) for all a; > and n large enough : 

- \n{d n ) < \n(2Vk + W)- .. \ 2 . HEIV/W}' 1 ) . (96) 

The same remarks as those following (l44l) hold, namely that the positive constant ln(E(V/IU] _1 ) is irrelevant because 
of the condition x > 0, and that an improvement of the proof would be in any case limited by the requirement 

7 < ^ 1 2 ^ 2V^+w 1 * n a g reernent wrtn the lower bound in Eq. ([12)) . 



VI. CONCLUSIONS 



In this paper we have characterized the double-exponential Lifshitz behavior of the Bethe lattice density of states 
close to its edge in presence of bounded (diagonal or off-diagonal) disorder in an indirect way, by controlling the 
asymptotic growth of its moments. As a byproduct of this study we have unveiled some geometric properties of the 
dominant supports of closed random walks on regular trees. Let us conclude by making a series of observations and 
suggestions for future work. 

We believe that most of the methods and results of the paper could be adapted to characterize the integrated 
density of states of the adjacency matrix of random graphs with arbitrary bounded degree distribution. Denoting 
kma,x the maximum degree, one expects a Lifshitz tail phenomenon to occur around 2^fk max as soon as the random 
graph is not fc max -regular. To characterize it one could use the results of the off-diagonal disorder model on Tfc max _i, 
with Jij £ {0, 1}. The J^- are i.i.d. only in the case of percolation (i.e. for a binomial degree distribution), but in the 
general case two are correlated only when they share a common vertex, so that it should be possible to handle 
these weak correlations, thus extending the results of [35l[36|. 
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FIG. 9: The density of states in presence of off-diagonal disorder for different degrees, with rescaled couplings Jij uniformly 
random on [~* s /2/k, \j2/k\. The k — oo curve corresponds to the Wigner semi-circle law of support [— \/8/3, ^/8/3]. 

When looking at the plot of the off-diagonal disorder density of states in Fig. Q] one could be tempted to divide its 
support in two regions, one where it is "large" and one where the Lifshitz phenomenon strongly reduces it. However 
the density of states is strictly positive everywhere inside its support, so that there is no unambiguous criterion on 
the location of the frontier between these two regimes. Such a distinction can be sharply defined by letting another 
parameter of the model evolve, i.e. the degree ft + 1 of the lattice. In Fig. [9] we have plotted the density of states 
of the off-diagonal disordered model for various values of ft; to allow for a meaningful comparison of these curves we 
have scaled in accordance the magnitude of the disorder, taken to be uniform on [— -\/2/fc, ^Jljk\. The support of 
the density of states is thus, independently of ft, the interval [—2-^/2,2-^/2]. However the density of states converges 
(pointwise) in this limit to the Wigner semi-circle law whose support is controlled by the variance of the Jy (and 
not by their maximal value as the support of the density of states) , and which is found here to be [— -y/8/3, \/8/3] . 
The regime \E\ G [-\/8/3, 2\/2] is thus, in this ft -> oo limit, the one of the Lifshitz tail. Note the similarity with the 
unbounded (Gaussian or Cauchy) diagonal disorder case studied in [l6|, [2l], [2(| [53] , with 1/fc playing here the same 
role as the multiplicative constant in front of the unbounded V^'s in these studies. 

A possible direction for future studies would be the investigation of the distribution of the largest eigenvalue of 
the Anderson model defined on random regular graphs, more precisely of its deviation from the upper limit of the 
density of states. In the Gaussian ensembles of random matrix theory the density of states is the Wigner semi-circle 
law, which vanishes as a square root at its edge; in that case the typical fluctuations of the largest eigenvalue around 
the edge of the density of states are of order TV" 2 / 3 and described by the Tracy- Widom la w f53l. This result has 
been extended by Sodin to the case of random regular graphs of fixed degree with = ±1 [54L l55|. In presence of 
a Lifhsitz tail in the density of states both the scaling with N and the distribution of the fluctuations of the largest 
eigenvalue should be modified. 

Finally, another direction of investigation could concern the numerical procedures used to solve Recursive Distri- 
butional Equations as |8]), The sample representation at the basis of the population dynamics [l2l,[4c| is very natural 
and allows simple and versatile implementations of the method, yet it performs poorly in sampling rare events of the 
Lifshitz tail type. A combination between a sample representation of the typical part of the distribution and a mesh 
representation for its very small probability part might provide a better alternative in such cases. 
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Appendix A: Numerical determination of the density of states and the center of the band singularity 



Let us first explain the numerical procedure we followed to obtain the density of states displayed in Figs. Ill3l and 
[5J As shown in Sec. Ill Bl this computation amounts to solve the Recursive Distributional Equation (RDE) of Eq. ©, 
and then to compute the average given in (JSJ . The numerical procedure we implemented to solve this problem, known 
as population dynamics [IH, |4(| or pool method [l8| , consists in approximating the distribution of G by the empirical 
distribution over a sample of N ^> 1 (in practice we used N of the order of 10 4 ) representative values of G. This 
sample is updated according to Eq. ([8} until numerical convergence is reached, then the average in (|9]) is estimated as 
an empirical average over the sample (and over several update steps, a few hundreds for the data shown, to increase 
the precision). The numerical accuracy of this method is controlled by the size M of the sample. The limit on the 
accessible values of N put by the memory available on present computers is such that the regime of Lifshitz tails, 
where the density of states is extremely small and hence requires a huge precision to be determined, is not reachable 
via such a numerical procedure. 

We shall now present the heuristic arguments which led us to the conjecture that in the case of continuous off- 
diagonal disorder with support intersecting a neighbourhood of 0, the density of states exhibit a singularity at E = 
of the form 

p{E) ~p(0) - a\E\^ asE^O, (Al) 

where a is a positive constant. This conjecture is in very good agreement with our numerical results for k = 2,3,4, 
as shown explicitly for k = 2 in the right panel of Fig. [3J 

Consider the RDE (JSJ on G at the center of the band (E = 0), and in the limit of vanishing regularizing imaginary 
part r\ (a similar limit of the RDE has been studied rigorously in with Ji = 1 and k a Poisson random variable). 
To simplify notations we introduce the random variable g = —iG(0), solution of the RDE 

S^-r 1 — • (A2) 
Ei=i Ji9i 

Since for Im z > the relevant solution of the RDE (JHJ) has Im G(z) > 0, we shall look for a solution of (|A2j) supported 
on [0, oo [. We will assume that g has a density JI that decays as a power law for large values of g, 

g- x ~\ (A3) 

where in this appendix ~ means asymptotic equivalence within a multiplicative constant, and x is an exponent that 
we shall determine self-consistently. First note that (|A3[) is equivalent to 

P(g>A) ~ -1. (A4) 

A— ¥00 A 

The recursive distributional equation ((SJ) relates the assumption on the behavior of Ji near oo to its behavior near 0: 

V(g<a)=v(j2 J ?9 i > ^ 

2 o>M (A5) 



- f\Jg>- 

a-J-0 \ a 



- P f g > - 1 - « 



In the first step we have used the heavy tail character of <?, hence the probability that the sum is large is close to the 
probability that a single term is large, the second step relies on the boundedness of J, and the last one follows from 
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4j. Now we use again §S§ in the other direction, i.e. to deduce the behavior at large g from the one at small g: 



\9>A)=P[J2j!g l <- 



A— >oo 



A— >c 



J < 




(A6) 



A 

Indeed the probability that the sum in the first line is small is roughly the probability that all the (positive) terms 
are small. In the second step we assumed x > 1/2, hence the probability that the product J 2 g is small is controlled 
by the smallness of J 2 , and the last step follows from the assumption that J has a finite density near (note that if 
the support of J were bounded away from there would not be any singularity in the density of states) . Comparing 
this with (|A4I) gives the self-consistency condition x = fc/2, which confirms the hypothesis x > 1/2 for all relevant k. 
The same computations applied to the density /x of g = —iG(0) yield the expansion: 



9 



a g 



{a' e K+) 



(A7) 



Now we reintroduce the regularizing imaginary part 77, and observe that when it is small its main effect is to provide 
a cutoff at I/77 on the distribution of ~g and g determined above directly at 77 = 0. This yields 



-JmE[G(0 + ir])} ~ - [ V gfx{g)dg 



1 f°° Oc f°° hi k 1 

- / 9V{g)&9 / .9~^ (1 + o(l)) dg = p{0) -a 77^ + o 

Jo 7T J 1/n 



(A8) 



The finiteness of p(0) follows from the integrability of gp{g), and a is a positive constant proportional to a'. If we 
assume finally that we can make the substitution 77 — > —iE in this expression, and that it corresponds to rj^~ — > 
\r]\^~ — !• I-El - ^, we get: 



p(E) = p(0) - a\E\— + (E~) , 



(A9) 



a form in perfect agreement with the numerical determination of the density of states for k — 2,3 and 4 (shown in 
Fig. [3] for k = 2). The justification of the jump from (|A8[> to (|A9[) is not obvious; indeed, the small r\ expansion shows 
that p is not analytic near 0, hence no 'analytic continuation' argument can be straightforwardly applied. However 
as the density of states has to be invariant under E + ir\ — > —E + ir/, one could assume for Imz > an Ansatz 
p{z) = p(0) + fi(\z\) + f 2 (z) with / 1>2 regular and f 2 = o(f\) when \z\ -> 0. 



Appendix B: From the moments of a random variable to its cumulative distribution function 

In Sec. Ill Dl we have seen how the asymptotic behavior of a probability density close to the edge of its support 
translates into the behavior of its large order moments. This appendix is devoted to the reverse direction of this 
connection, namely to what can be learnt on a probability measure from the asymptotic knowledge of its moments. 
In particular we provide the proofs of Eqs. (I14I15[) . 

Consider a non-negative random variable X, with a probability distribution 77 supported on [0, E + ], where E + 
is a priori unknown. Note that the assumption that 77 is supported on non-negative reals is not restrictive for the 
cases we want to consider here: in the off-diagonal disorder case one can use the symmetry of p to come back to the 
above situation, and in the diagonal case the density of states is supported on [£L, E + ] with \E-\ < \E+\, hence the 
negatively supported part of the probability measure is asymptotically irrelevant. Thus Eqs. (|14I15[) will be direct 
consequences of (I11I12P and (|B7I) . 

Let us decompose the expression of the n-th moment of X, for an arbitrary S £ [0, £*+], as: 

u n = E[X n ] = E[X n \X <E+- S]P[X <E + -5}+ E[X n \X > E+ - 6}F[X > E+ - 5] . (Bl) 
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From that equation one easily obtains: 

(1 - N(E+ - Sj) (E+ - S) n <u n < (E+ - 8) n +El(l- N(E+ - 5)) . (B2) 
Hence one has, for arbitrary n: 

1 - N(E+ -S)< (E+ - Sy n u n , (B3) 

(8 \ n 
1 - -g~ J ■ (B4) 

As this is valid for any 8 > 0, this proves our first point about the link between the support of the density of states 
and the exponential growth of its moments: 

ln-EIf = lim — In it„ ; (B5) 

n— >-oo fi 

a similar argument can be found in (39j . 

In order to obtain some bounds on the behaviour of N(E + — 8) one has to choose the value of n in Eqs. (IB3IB4|) 
in an optimal way, for a given value of 8. As our control on u n is limited to large values of n, the bounds on the 
cumulative distribution function shall be relevant only for small values of 8. The result will of course depend on the 
precise form of «„, or more precisely on its large n asymptotic. In the following we will assume that u n has the large 
n behavior of the form (|11I12[) of central interest here; the reasoning is however more general and can be applied to 
reconstruct the behavior of N from u n in the three examples of Sec. Ill Dl We shall thus make the following assumption 
on the large n behaviour of u n in order to treat within the same frame the off-diagonal and diagonal disorder cases: 

Assume that for some c, e, x > and hq G N it holds for n > uq that 

-(l + e)E + -^ < -\nu n -U(E+) < -E+ C . (B6) 
Then for 8 > small enough one has: 



2 + 'J:r 



e" e " < 1 - N(E+ - 8) < e~ e x ' . (B7) 

We start with the right inequality. Using the upperbound on u n from (|B6p . (|B3[) reads for n > n : 

1 - N{E+ -8)<(l- e ~ E+c 0^* . (B8) 



Now we take: 



2~7 



n= [e\ 6 J J +1 . (B9) 
For 8 small enough n becomes larger than no and one gets, replacing into (|B8p : 

l-N(E+-8) <e- e ^ ' . (BIO) 



The left inequality can be obtained in a similar way: this time (|B4|) may be rewritten, under assumption (|B6[) and 
for n large enough: 



1 - N{E+ -S)> e - {1+t)E+c ^ A-J . (Bll) 

To obtain a valid bound we want the first term to be much larger than the second, that is: 

^ -(l+e)B+c_L^ ^ 8 

Thus we take 

n=[eV^T J (B13) 
which satisfies (|B12|) and gives, replacing into (|B11[) : 



e > ( i _ ^_ j . (B12) 



1-N(E+-S) >e" eV . (B14) 
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Appendix C: Proof of the recursion formula for the moments 



In this appendix we give a formal proof of Eqs. (120I21[) . making use of generating functions; we follow the standard 
notations in this field, namely if F(z,x) is a formal series, [z n x q ]F(z, x) denotes the coefficient of its z n x q term. 

Let us first introduce, for a given realization of the disorder, the generating function F{z) — (0| |0), which is 
equal up to a change of variables to the resolvent at the root of Tfc. We thus have u n = E[[z n ]F(z)]. We decompose 
the operator H according to 



fc+i 



fc+i 



h = v\o){o\ + (K)<o| + |0)(i|) + ^2 Hi, 



(CI) 



where Hi acts only on the subtree rooted at i, one of the k + 1 neighbors of the root. We introduce the generating 
functions for each of these subtrees, Fi(z) = (0| T ~. |0), and use the resolvent identity between operators -\ — -g = 



-(B — A)j; with A = I - zH, B = I - z (v\0)(0\ + J2i=iHi\ to S et: 



F(z) = 



1 



1 - zV 
1 

1 - zV 







zH 

8=1 

' fc+1 / 



i- 2^oxo| 



i 



I-zH 



(C2) 



where we used the fact that i?j|0) = 0. One can show in a similar way that ^0 
F(z) = (\ + F(z)z 2 J2 JfFiiz)^ F{z) = - 



zH 



i) = zJiF(z)Fi(z), so that 



(C3) 



Let us now introduce a bivariate generating function, F(z,x) = 1 _ a . 1 p( z ) ■ As a consequence of the recursion equation 
on F(z), one obtains: 



F(z,x) 



E (T 



(C4) 



x (1 - x) s +p +1 ' 

p+s>l 



E 



Pi,---,Pfc+i>0 
PiH hPfc+i=p 



S + J3 

s,_pi, . . . ,p k+1 



fc+1 fc+1 



As u„ = E[[z n x]F(z, x)], we obtain for n > 1 



E 

s,pi,mi,...,p )t+ i,m fe+ i>0 
s+mi+2piH hm k+1 +2p k+1 =n 



s+ p 

S,Pl, ■ ■ ■ ,Pk+l 



i=l i=l 



fc+1 



E[J 2pi ] . . . E[J 2p *+i]E[V s ] Y[ E[[z mi ]Fi(z) pi ] 



(C5) 



i=l 



which proves Eq. (|2"0)l with u(n,q) = E[[z n ]Fi(z) q ] — E[[z n x q ]Fi(z, x)]. In this last expression we introduced the 
bivariate generating function Fi(z,x) — 1 _ X 1 F ■ The proof of Eq. (f2"Tj) follows exactly the same lines, using T k 
instead of (hence the root has only k neighbors), and extracting the coefficient of order x q in the equation 
corresponding to jC4} thanks to the identity [x 9_1 ](l - x)-^ s+ ' p+1 ^ = ( s+ ?t? _1 ). 
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Appendix D: Computation of u(n,q) in the pure case 



In this appendix we explain how to find the solution (|25[) to the recursive equation (|2ip in the pure case. Following 
the interpretation of u(n,q) in terms of walks on U (—1) given in Sec. MI B 11 one realizes that in the pure case 
u°(n, q) is fc"/ 2 times the number of Dyck paths of length n + 2q that start and end at height 1 and visit q times the 
height 0. Following the reflection principle discussed in [5(|, this amounts to count the number of paths from (0,0) 
to (n,q) that reach the height q for the first time at abscissa n, hence u°(n,q) = ( r "/2+q) n+qk n ^ 2 ■ In particular this 

gives back the well-known value of c° = u°(n, 1) as the Catalan number times fc"/ 2 . 

One can also check analytically that this expression for u°(n, q) is indeed the solution of the recursion relation (|2ip . 
The simplest way to do so is to take benefit of the equivalence between (|2"Tj) and the equations on bivariate generating 
functions from which we derived (|2ip in App. [Cj Simplifying them in the pure case, one realizes that the verification 
boils down to prove: 



n/2 + q) n + q \ 2ki 

This equality can be proven 50] by checking it directly for q — 1 and all n, and by showing that both sides l n ^ q and 
fn,q of the equation obeys the same recurrence equation, 

, _ ln+2,q—\ — ln+2 : q-2 _ Tn+2,q-l ~ ?"n+2,g-2 (T>9"i 



Appendix E: Dyck and Motzkin paths of restricted height 



FIG. 10: Left: a Dyck path of length 12 and height 3. Right: a Motzkin path of length 11 and height 3. 

We explain in this appendix the derivation of the properties of Motzkin paths of restricted height that we used 
during the proofs of the lower bounds in Sec. EES and E21 A Motzkin path of length n is a sequence ho, hi, . . . , h n of 
non- negative integers such that h$ = h n = 0, h i+ i — hi e {+1, 0, — 1} for alH £ [0,71—1], see Fig.[TU]for an illustration. 
It is called a Dyck path if there is no horizontal step h i+ i = hi. Its height is the maximum value of hi reached for 
i G [0, n]. Let us assign a weight a (resp. c, 1) to an ascending (resp. horizontal, descending) step, and define the 
weight of a Motzkin path as the product of the weight of its steps. We denote by m n> h, the sum of these weights 
over the paths of length n whose height is smaller or equal to h. The generating function G^- h ' (x) ~ ^2 n>Q m n ^x n is 
known to be [5l|, [HJ : 

^ = 1^^, (ED 
xp h+ i(l/x) 

where the Ph{x) are polynomials of degree h, solution of the recurrence equation: 

Ph+x(x) = (x- c)p h (x) - ap h -x(x) , with p (x) = 1 , pi(x) = x - c . (E2) 

It is convenient to change variables and define the polynomials qh by qh(x) = -^j^ph(2^fax + c). Indeed the recursion 
becomes: 

qh+\{x) = 2xq h (x) - q h -i(x) , with q (x) = 1 , q%(x) = 2x . (E3) 

Hence one recognizes that qh{x) — Uh(x), where Uh is the h-th Chebychev polynomial of the second kind. 

Consider now the rational function Uh(x) /Uh+i(x); it has h+ 1 simple poles located at the roots of I4h+i(x), namely 

Xj (h + 1) = cos ( ^2 ) with j £ [1, h + 1]. Using basic properties of the Chebychev polynomials one easily obtains the 
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following decomposition: 



Uh{x) -yJ-wf^-) \ — -. ( E4 ) 



U h +i{x) + 2 \h + 2J x- XjQi + l) 



Replacing the pn in terms of the Chebychev polynomials in Eq. (jElj) and using the decomposition above leads to an 
explicit expression of the generating function, 

^ M .g^^.(^) _^_^ , (E5) 

and of the coefhcient m nt h after the expansion in powers of x, 

This expression is exact for all n and h. For n even all terms in the sum are positive and one can lowerbound it by 
retaining only the term j = 1 and by using the inequality sin# > 29 /ir for 9 G [0,7r/2]. This yields the bound 

m^>(^) 3 ( C+ 2^cos(^)) , (E7) 

which we used in Sec. IIV Al with a = 1 and c = 0. 

Let us now justify the asymptotic statement on m n /, used in Sec. IV Al . Consider first the behavior of m n ,h for 
c > 0, n — ¥ oo with h fixed. In Eq. (IE6[) the term raised to the power n is maximal for j = 1, and is strictly greater 
in absolute value than all others, hence 

lim — lnm n % = In [ c + 2y / a cos | - — — ) ) . (E8) 
n-s-oo n \ \ h + 2 J J 

Suppose now that h diverges in an ro-dependent way. As long as h <C n 1 / 2 one can check that the terms j > 2 are 
subdominant, and that 

I ln^ = ln( C + 2^-^^ + 0(1) . (E9) 
This is the result used in Sec. IV Al with a = k, c = Vq and h = [alnnj, which indeed satisfied the condition h <§; n 1 / 2 . 



Appendix F: The degeneracy of walks induced by self-bond steps 

This appendix is devoted to an alternative proof a statement used in Sec. IV Bl namely that there are (") walks 
oj of length n with s self-bond steps that share a common underlying walk uj of length n — s, obtained by removing 
the self-bond steps from u>. Let us denote a the common skeleton of length n — s. From Eq. (|22|) one reads the 
combinatorial factor associated to such a skeleton complemented by the numbers {s v } ve \> of self-bond steps on each 
of the vertices: 

~. r 1N ( so + rii + n% -\ \-n k+1 \ -j-r f n v - 1 + s v + n Vl H \-n Vk \ 

V s ,ni,n 2 ,...,n fc+ i / v ^\ V n v - 1, s v ,n Vl , . . . ,n Vh J 

_ fs + ni + n 2 H \-n k+1 \ -q fn v — 1 + s v + n Vl H ^ n Vk \ 
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In the second line we have factored out the combinatorial factor rj(a) of the walk deprived of the self-bond steps. We 
compute now the sum over all possible choices of the {s v } V £y: 



so + ni+m-\ +rik+i\ a \ i r / ( n v — 1 + s v + n Vl + 



e vpm) = vow] e 1 y k+1 y sa ) n e 

(s„}, ev \s =0 \ ' / veV \o\s v =Q 

= 7 ? (a)[z s ](l-z)-( 1+ " 1+ " 2+ -+" fc + 1 ) ]J (1 - (F 



uev\o 



where we have used the combinatorial identity [z p ](l — z) ( a+1 ) = ( p ^ p a ), recognized that for each edge e n e appears 
twice (in the binomial coefficient associated to both its endvertices), and noted that by definition of the length of the 
skeleton 2 X^ev\o Hv = n ~ s - Each of the rj(a) walks w without self-bond steps compatible with a gives thus birth 
to (™) walks with s self-bond steps. 

We shall now explain the proof of the upperbound (|89]l on E(ZJ, s,A), the number of walks to 6 W n that are obtained 
from uJ £ 7Vin-s by adding to it s self-bond steps, in such a way that the self-support of cj is A. This number is 
smaller than the number of walks uj with a self-support included in A (not necessarily equal to „4) . This last quantity 
can be computed as 

1 „ /s - 1 + 1(0 e.A)(l + m + ■■■ + n k +i)+ E {n v + n Vl + ■ ■ ■ + n Vh )\ 

-—- 2^ ri(a,{s v })=[ veA\o ), (F4) 

where we denoted I(M) the characteristic function of the event M, and obtained the right-hand side with a reasoning 
similar to the one in (|F3j) . One can trade the sum over vertices for a sum over the edges of the support by introducing 
the numbers d e (A) € {0, 1, 2} which count the number of endvertices of the edge e which belongs to A, and rewrite 
the last quantity as 

n-(I(Q^A)+T,n e m2-d e (A)))\ 

eGer I . [t 5) 



Finally is obtained by using the facts that n e (uj) > 1 for all edges in the support and that each vertex of A has 
at most degree k + 1 in the support. 
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